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a centre for the publication or other form of distribution of information concerning 


National Eugenics; (iii) as a school for training and assisting research-workers in 
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ON THE SKULL AND PORTRAITS OF GEORGE BUCHANAN. 
By KARL PEARSON, F.R.S.* 


I ExPEcT that many of those present will consider it an act of great rashness 
that a southerner should venture into the metropolis of northern Britain and dare, 
a mere scientist, to talk about a man whom many Scots look upon as Scotland’s 
greatest scholar, leading historian and the first writer in Scotland to propound a 
political philosophy. 

But I would remind you that for good or ill we cannot separate the history 
of our two countries. Buchanan played a large part—a larger part than some of us 
recognise—in those still obscure proceedings which, starting at Langside, were 
continued at York and Hampton Court, and culminated only in the final scene 
at Fotheringay. Mary Stewart’s history is as much English as Scottish history, and 
the story of both countries would not only have been different, but differently 
written, had the Tudor blood been as dominant in Mary as in Elizabeth; while 
the psychological bearing of the extent to which Mary did inherit Tudor charac- 
teristics has been often overlooked by the historians of both countries. 


The time has come when we are justified in taking, not a heptarchical view 
of the history of Britain, but a British view, which would recognise that local 
histories can be interwoven into a wider history of our united countries. Queen 
Mary, George Buchanan, Darnley, Moray, Bothwell, and the rest of the Scottish 
actors of those days really played their parts on a wider stage than local history 
provided ; that is not only the source of their perennial interest, but the excuse 
for my appearance and choice of a subject here this afternoon. 

Previous lecturers for the Henderson Trust have told us that while much of 
the old phrenology is no longer acceptable in the light of modern science, yet 
increasing acquaintance with the function of brain centres may lead us, not so 
much from observations on the external surface of the skull as from the markings 
on the endocranial cast, to suggestions—not to use too emphatic a word—as to 
character and abilities. I should like at once to state that this is not to be the 


line of my inquiry this afternoon. I think an endocranial cast of George Buchanan’s. 


skull would be of interest, but I have not ventured to take it nor to ask for it to 
be taken. 


I want to put before you something much less abstruse, and more direct in its 
appeal to your imaginations. 

In the rough and tumble of everyday experience, in trusting to our fellow 
men, in the appointment of colleagues, assistants and servants, and even in the 


* Being a lecture delivered on June 4 in the Anatomy Theatre of the University of Edinburgh for 
the William Ramsay Henderson Trust. 
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choice of acquaintances and friends, we are largely influenced by our reading of 
character from physiognomy. We may admit it or we may not, but it is a fact 
that facial expression draws our confidence or serves as a warning mark. The 
reading of physiognomy is not a science with its ascertained rules; it is not wholly, 
I believe, a result of past experience, but rather an instinct modified by experience. 
We are reduced to the sentiment of Martial, as englished by Tom Brown: 
“T do not love thee, Doctor Fell, 
The reason why I cannot tell ; 


But this alone I know full well, 
I do not love thee, Doctor Fell.” 


Even a Christchurch undergraduate is susceptible to facial expression! 

Probably in judging half-unconsciously by physiognomy we are right in nine 
cases out of ten, and badly wrong on the tenth occasion. We should do better, 
if we judged by the dynamic expression as given by the film of an individual 
rather than by the static expression conveyed by a bust or portrait. 

The reconstruction from the skull of the head is a branch of science, which has 
of recent years made some progress, but while the facial expression does to some 
extent depend on the form of the skull, and the strength of the musculature which 
can be partially judged from the skull markings, we are left wholly in the dark as 


to those facial lines which leave no permanent traces on the skull itself, and which 
have so much to do with our judgment of character. 


Why is it that we desire to see and to acquire portraits of great men, or of the 
individual men we treat as our personal heroes or saints? I think it undoubtedly 
arises from the fact that a portrait, at any rate a good portrait, helps us to a truer 
characterisation, a truer appreciation, than many pages of verbal description. We 
are thrown back on our instinctive judgment of physiognomy. I believe that 
at present one of the chief services which can be done by aid of the skull of a 
great man is to find out from it which portrait, if portraits exist, truly represents 
him. With all due reverence I would exhume the skull of Shakespeare, of 
Newton or of Milton, and then we should be able to standardise their portraiture, 
and say: Here is the true man, and what would also be of interest: There is 
the artistic liar! 


Something of this kind has been done in my Laboratory for two historic skulls— 
that of Sir Thomas Browne and that of Henry Stewart, Lord Darnley; and I may 


be permitted to indicate here how the method serves to differentiate good from 
bad portraiture. 


On Plate I we have a portrait of Jeremy Bentham fitted with the significant 
lines of his “ Auto-Icon.” The mummified head, when the drooping mandible is 
brought into touch with the upper jaw, gives a quite passable fit, showing that 
the artist was reasonably truthful. On Plate Il we have a contrast between two 
portraits of Sir Thomas Browne fitted with the outline of his skull adjusted as 
closely as possible to the two representations. The earlier or Allix portrait agrees 
well with the skull, while the later portrait, almost certainly painted after his 
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death for the Royal College of Physicians, exaggerates his low frontal into “high- 
brow” intellectuality. The artist was quite clearly an incorrigible idealiser. 

Can we apply the same method to obtain further light on George Buchanan as 
judged from his portraits? To attempt this will be the purpose of my lecture this 
afternoon. But before we proceed further with the portraits, it is wise to inquire 
what evidence we have for the authenticity of the skull. 

Buchanan died on September 28, 1582, aged 76 years. 


There is nothing in the skull to contradict this age. It is the calvarium of an 
old man; there is no mandible, which would have told us something additional, 
but the teeth in the upper jaw must have been present at death, although the 
alveolar border has extensively receded. 


The bony walls of the skull have been reduced in parts to the thinness almost 
of a railway ticket in a manner that is very characteristic of extreme age, and 
familiar to those who have handled large numbers of crania. The zygomatic 
arches, at least to judge from the one that is complete, could never have been 
massive and have dwindled with age almost to the thickness of a quill. The owner 
of the skull could hardly have had prominent cheek bones, except by extreme 
thinness of the flesh. 

Coming to the history of the skull itself, we naturally find its first purloming— 
as in the case of Sir Thomas Browne’s—involved in obscurity. It is the more so, 
perhaps, in this case, as the purloiner was a man of position in Edinburgh—no 
other than the Rev. John Adamson, Principal Regent of the University from 
1623 to his death in 1653. Adamson was a great admirer of Buchanan, but he was 
hardly likely to state, during his official life, how and when he had despoiled 
Buchanan’s grave. Puissant as we all recognise the principal of a university to be, 
humbly as we obey his dictates, yet we perceive that his autocracy—wide as it is— 
does not include body-snatching as a prerogative. There are some things which 
even the principal of a university cannot do publicly, but we may be fairly sure 
that when he does them sub rosa, he will carry them through successfully, and 
obtain exactly what he is seeking. I think we need not doubt that Adamson, 
having set his heart on Buchanan’s skull, was not likely to fail in getting the 
actual article. 

Adamson was educated at St Andrews, being born in the year 1576; he 
held I think a professorship of philosophy there, but in 1598 became a regent in 
Edinburgh, holding his chair there till 1604, when he accepted a ministerial call 
to North Berwick, being afterwards translated to Libberton and returning to 
Edinburgh as Principal in 1623. 

In 1598, George Buchanan had been buried for 16 years, in 1623 for 41 years. 
We must further note that Buchanan had died in poverty, directing the small sum 
of money he had left to be given to the poor, and telling his friends to let him lie 
where he was till the City of Edinburgh buried him. He appears to have been 
interred without a tombstone. Adamson in his epitaph or eulogy on Buchanan 
draws attention to this fact, saying that only those need monuments who have 
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nothing else to be honoured by. A grave without a tombstone is very likely to 
be encroached upon for later interments, and Adamson was undoubtedly familiar 
with the situation of Buchanan’s grave. Probably he obtained the skull from the 
grave-diggers at some later interment at or near Buchanan’s tomb. Sir Robert 
Sibbald, born twelve years before Adamson’s death, writing of Adamson within 
thirty years of the latter’s death, tells us, somewhat naively, that by his, Adamson’s, 
care the skull of Buchanan, dragged from its grave, has been preserved in the 
Library of the University of Edinburgh. There it remained, accompanied by a 
copy of Adamson’s Latin verses, to which I have already referred, till 1817, when 
it was transferred to the Museum of Anatomy. About 100 years after Buchanan’s 
death Mr Robert Henderson, University Librarian, refers to the skull, and says of 
Adamson: “It’s storied of him that being at Buchanan’s burial, he bargained for 
the skull; which having got from the graveman within the year or so, he care- 
fully kept in his life-time; and being found in his study so inscribed after his 
removal was handed down to us.” A child of six might be, if improbably, at the 
burial of Buchanan, but he certainly would not have bargained with the graveman 
for Buchanan’s skull on that occasion. However the main point of Henderson’s 
account, that the skull was found in Adamson’s study after his death, inscribed 
with the name of Buchanan, doubtless corresponds to the truth. And if so, Adamson 
would hardly have obtained the skull without the assistance of the sexton, or kept 
it without a fair amount of certainty that it came from George Buchanan’s tomb. 


It is noteworthy that while John Adamson says there was no tombstone to 
Buchanan’s grave, we read in 1701 of the existence of such a stone. 


“ At Edinburgh, the 3rd day of December 1701, the same day the council being 
informed, that the through stone [tombstone] of the deceast George Buchanan 
lyes sunk under the ground of the Greyfriars: therefore they appoint the chamber- 
lain to raise the same, and clear the inscription thereon; so that the same may be 
legible.” 

If we could ascertain at what date that stone was erected, we should have an 
upper limit to the date when the Principal of the University robbed the Grey- 
friars, for there was clearly no stone when he wrote his verses. 

I have not succeeded in finding any real work done on the skull in relation to 
the portraits. 

Mr William Carruthers in the Scottish Review for October 25, 1906, dismissed 
the skull, stating that it could not be that of George Buchanan, because it was not 
like the portrait in the National Portrait Gallery in London. The question seems 
to be: Is it like any other portrait ? if it is, it would only show that the National 
Portrait Gallery picture is not a truthful representation of Buchanan. 


Sir William Hamilton is said to have tested Buchanan’s portraits against the 
skull about 1835 by measurement and found that only the Chouet woodcut (see 
Plate III) fulfilled his tests. In other words he appears to have rejected the two 
portraits in the Senate Hall of his own University, which are of the National 
Gallery type. Speaking of the Chouet wood engraving and its reproduction by 
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Freiherus, Hamilton writes: “The head in both is thoroughly Scottish in character 
with a long and well-formed nose, well-defined cheek bones and a long upper lip 
as in the skull.” Now these three characters are all present in the National 
Portrait Gallery type (see Plate IV), but appear to be quite absent in the skull. 


I am not clear what a head “Scottish in character” may be. It is not possible 
craniometrically to distinguish really significant differences between the Lowland 
Scottish measured by Turner and the 17th century Londoners measured in my 
own Laboratory. Recently some 400 crania from a graveyard in London dating 
round the plague year 1665 have been measured by us and we have obtained the 
type contours on the average of some 140 male skulls. The upper lip and the 
nose of this typical skull from the pauper graveyard are identical in length with 
those of the Buchanan skull, which like these 17th century Londoners, had no 
specially marked zygomatic arches*. The main difference between the Buchanan 
skull and that of the 17th century Londoner lies in the very feeble development 
in the former of the occipital and cerebellar regions. 


I do not think we can place any weight on Sir William Hamilton’s rejection of 
all but the Boissard-Chouet engraving, especially as that engraving is in my opinion 
undoubtedly a copy of the Edinburgh or Bronckhorst type of portrait, a point 
which Sir William Hamilton seems to have overlooked. 


Lastly I may refer to Professor Hume Brown’s Life of George Buchanan, 
wherein he writes: “It may be added that what on good authority is supposed to 
be Buchanan’s skull is preserved in the Anatomical Museum of the University of 
Edinburgh. Its general outline is exactly like that of the best portraits, and by 


its dome-like shape and extreme tenuity it has all the marks of a high cerebral 
development.” 


The skull has certainly extreme tenuity, but why this should be a mark of 
“high cerebral development” instead of as usual the concomitant of old age, 
I do not understand. As to its dome-like character, its auricular height is just 
the average of our Londoners +, and the “dome,” such as it is, is produced in 
appearance by the marked absence of occipital development. Professor Hume 
Brown does not tell us the locus of the best portraits which the general outline 
of the skull is exactly like. The fact that Sir William Hamilton and Professor 
Hume Brown appear to reach opposite conclusions may, perhaps, justify a bio- 
metrician in endeavouring to advance further by more elaborate measurements. 


I have placed in the accompanying Table in parallel columns the usual series 
of cranial measurements taken in the Biometric Laboratory, for (i) the average 
male Londoner of the 17th century, (ii) the late 16th century George Buchanan, 
(iii) the 17th century author of the Religio Medici, and (iv) the early 19th century 


* Zygomatic Breadth: Buchanan’s Skull, 130 mm.; Farringdon Street Graveyard, 130°lmm. Facial 
Breadth : Buchanan’s Skull, 92:5 mm. (?); Farringdon Street Graveyard, 91°4 mm. (Moorfields Plague 
Pit, 93°9 mm.). 

+ Males : 17th century Londoners. Farringdon Street Poor Graveyard 110°0, Whitechapel Plague 


Pit 112'1, Moorfields Plague Pit 111:3. These are mean values, hence the Buchanan skull, 110-0, is 
simply mediocre, 
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Comparison of the Cranial Measurements of George Buchanan with those of other Skulls. 


Cranial Character 


Capacity : 

Length (Flowers) 

Length (Maximum).. 

Breadth (Parietal) ... 
Breadth (Minimum frontal) | 
Height (Basio-vertical) 
Height jue 
Height (Auricular) .. 

Skull Base 

Arc, Transverse (Apex) 

Are, Sagittal .. 

Are (Nasion to Bregma) 

Arc (Bregma to Lambda) ... 
Arc (Lambda to Opisthion) 
Chord (Lambda to Opisthion) 
Horizontal Circumference ... 
Alveolar Point to Nasal ait 
Facial Height sae 

Facial Breadth 
Bizygomatic Breadth 

Nasal Height, R. 
Nasal Height, L. 
Nasal Breadth 
Dacryal Subtense 
Dacryal Chord 
Dacryal Arc ,.. 
Simotic Subtense 
Simotic Chord 
Breadth, R. Orbit: 
Breadth, L. Orbit 
Height, Orbit 
Height, L. Orbit 
Palate Height 
Palate Breadth re 
Palate Length to base of Spine 
Profile Length 
Length, Foramen Magnum 
Breadth, Foramen Magnum 
Profile Angle... 

Angle at Nasion 

Angle at Basion a 

Angle at Alveolar Point... 
lst Cephalic Index (100 B/L) 
2nd Cephalic Index (100 H/Z) 
3rd Cephalic Index (100 B/#) 


Facial Index .. 

Nasal Index, R 

Nasal Index, L. 

Foraminal Index 

Dacryal Index 

Simotic Index 

lst Palate Index (B readth/Length) | 
2nd Palate Index 
Occipital Index ae 

Orbital Index, R. 

Orbital Index, 

External Orbital Width 


Compound Index (B- H)/ 


Deviation 
Buchanan sku 
Skallof gir Thomas | Jeremy | 

George Buchanan Century gs. 

Difference 
8.D. 
1360 (2?) 1509 1475 — °93 
171°3 190°0 183°5 —2°75 
174°0 194°6 186°0 — 2:29 
140°0 143°7 147°0 ‘41 
98°8 99°0 + °34 
130°6 (?) 122°0 + 
129°9 121°9 132°0 + 
111-0 102°6 116°0 “00 
100°9 (7) 102°7 102°0 + ‘18 
304°0 297-0 325°0 (?) — 
355°4 380°0 365°0 (?) —1°65 
119°5 29°0 130°0 —1°51 
124-0 123°0 120:0 (?) - ‘51 
111-9 (?Reconst.) 128°0 115°0 (22) -1:11 
91°3 (? Reconst.) 99°8 (2) -1:17 
503°0 149-0 540°0 (2) —1-70 
21°5 21°7 (#) 22°0 (2?) + °82 
69°6 73°1 (2) 73°0 (2) — °20 
92°5 (?) 90°9 97°0 (2?) + ‘18 
130°0 132°3 (2) — 21 
51°0 52°2 52°8 — 28 
51°7 §2°2 52°8 -00 
25 °6 (2) 22°6 26°7 + °40 
12°0 12°8 11°9 (2) — 
32°0 32°7 33:0 —1°15 
4°5 5:0 4°3 ‘09 
10°4 9°5 + °63 
45°8 43°7 45-2 (2) +2°24 
44°8 42°8 (?) +1-62 
35°9 36°3 36:0 (2) + ‘68 
36°8 36°7 36:0 (2) +1:09 
10°5 16°5 12°0(%) — 42 
36°8 37°5 (22) 35:0 (2) — *82 
38°7 (22) 49-0 (22) 47:0 —2°75 
89°5 (2) 98°3 (2) 89-0 (2) — °92 
32°6 (2? Reconst.) 346 (2) —1-40 
— 30°1 31°0(?) 
87°'8 84°-7 (2) 93°-0 (2) + °60 
60°°5 (2) 65°°4 58°-0 —1:01 
42°°3 (7) 42°°7 (?) 44°°5 — ‘05 
77°°2 (2) 71°°9 (2) 17° 5 +1:07 
80°5 73°8 79°0 +1-47 
75°1 (2) 62°7 70°9 +1:80 
; 107-2 (?) 117°8 111°4 — *40 
6°7 5°4 (2) 11-2 — 
| 75°2 (2) 80°4 75:3 (2) — °30 
47°5 50°2 43°3 50°6 + *63 
47°5 49°5 43°3 50°6 + °46 
83°4 85°7 85°6 
58°1 54°1 61°2 53°6 (2) + 
50°7 43°3 57°5 45:3 (2) + ‘57 
85°4 (22) 76°5 (22) 74:5 +1:75 
28°5 28°5 33°7 (77) 34:°3 (2) ‘00 
58:0 58°3 (?) 55°8 61°3 + ‘ll 
81°0 78°4 79°6 (2) — 
80°9 85°7 78°3 + °23 
98°5 98°0 100°4 + 


Aver 

3 
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143 

128 
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Jeremy Bentham. None of these skulls is in anything like perfect condition, and 
that of Jeremy Bentham is in the mummified head, so that many measurements 
are difficult to take upon it, and are marked in the table with queries and double 
queries. Sir Thomas Browne’s skull is in a more complete condition than 
Buchanan’s, but still owing to breakages certain measurements are doubtful. In 
the case of Buchanan, a portion of the left orbit, a considerable portion of the 
occipital bone and the whole of the foramen magnum have disappeared. There 
is no mandible, and accordingly I have not given the mandibular measurements 
for the other crania. In the case of Buchanan I have given measurements obtained 
by a reconstruction in plasticene of the base of the skull. I do not propose to 
enter fully into the methods adopted in reconstructing parts of a skull. They can 
be carried out with more or less exactness, when we know the correlations of its 
component bones. As an illustration I may take the reconstruction of the basion 
or anterior point of the foramen magnum. We have first to measure as far as is 
possible the capacity of the skull. After closing the orifices this was found by 
mustard seed packing to be 1360 cms.’, very small as compared with the average 
English, or either Sir Thomas Browne’s or Bentham’s skull. Now we have a 
formula for finding the capacity from maximum length, parietal breadth and 
auricular height. This formula gave a capacity of 1375 cms.*, quite as close to the 
capacity obtained by mustard seed as could be hoped for. Assuming Buchanan’s 
capacity as known, we can compute the height from the bregma to the basion 
because we have a second formula giving the capacity in terms of maximum 
length, parietal breadth and basio-bregmatic height. Proceeding in this way one 
is able to determine reasonably accurate positions for points on the missing part 
of the base of the skull, and fill in the portion broken away with plasticene. That 
is what is meant by the word “ Reconstruction” against certain measurements in 
the table. 


We have seen that Buchanan’s skull had a small capacity, and it is at once 
clear why it had. The height of the skull and its breadth are not small, they are 
simply mediocre, but the length is remarkably small. Sir Thomas Browne compen- 
sated his low frontal and small auricular height by a great length and an increased 
parietal breadth. But nothing of this kind occurs in Buchanan’s skull. It is as if 
the occipital region and the. cerebellum had been simply cut away. This can be 
shown at once by superposing the horizontal contour of Buchanan’s skull on the 
type or average contour of 140 male English skulls*. You will remark at once 
nearly complete correspondence in the frontal region, but a total absence of 
occipital development in Buchanan’s skull. You may see the same point in the 
other two main contours. In the transverse or vertico-auricular section Buchanan 
is very nearly simple British, he might easily have been a 17th century Londoner ; 
in the sagittal section he is also mediocre British until we come to the occipital, 
and here comparison is not feasible, because in his skull that region may be said 
to fail entirely. 


* Tissues of the average Farringdon Street contours accompany Miss Beatrix Hooke’s paper in the 
preceding part of this journal and are reproduced with this part. 
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Now let us examine the characteristics of Buchanan’s skull by another method, 
namely by taking the difference between the character in Buchanan’s case and 
the corresponding average value in English crania, and dividing this difference by 


the average variability of the character in the English population. When this 
ratio is: 


1, 1 person in 6 has the characteristic more emphasised. 

25,onlyl ,, 100 


» » 


I think a characteristic can hardly be spoken of as individually remarkable 
until it occurs only once in at least 50 or more individuals. Values of 1 to 2 may 
be said to contribute somewhat to individuality, but not to be markedly personal 
features. Now let us look down the fifth column of our Table on p, 238 and see 
the points in which the owner of this skull possessed individuality. We may 
examine first certain points in which he certainly had no individuality. I refer 
to those mentioned by Sir William Hamilton. George Buchanan had certainly 
not a long nose, it was just the nose of the average man we meet in the street; 
he had not a long upper lip, one man at least in every five we see has a longer lip 
than George Buchanan had; again he had not well-defined cheek bones, for his 
facial breadth and bizygomatic breadth are just normal. In addition he had not 
a high forehead, his cranial height, whatever estimate we take of it, is simply 
mediocre, and his frontal bone has precisely the shape of the average British 
frontal bone. The skull in no particular corresponds with that of a man of high 
brow, long nose, long upper lip and well-defined cheek bones. 

Now let us see the points in which the owner of the skull was individual. 
First we see from the — 2°75 and — 2°29 that he was individual in the extreme 
shortness of his head. Next we see that the sagittal arc and its components were 
short, and that the horizontal cireumference was small; this of course follows from 
the shortness of the length, but the individuality is by no means so marked. In 
other words the ares have made up for some of the deficiency of the occipital 
region. We may express this by saying that the skull, if small, is “ well-filled.” 
The idea indicated by “ well-filled” is the following: Suppose we made a box of 
the exact length, height and breadth of the skull, and placing in it a bladder, 
blew air into the bladder until it just touched all six sides of the box. That 
would not be a “ well-filled” bladder. Now continue to blow air into it; its three 
principal diameters would be forced to remain the same, but the sagittal, trans- 
verse and horizontal arcs would continue to increase and compensate for the 
smallness of the principal diameters; such a bladder would then be “ well-filled.” 
This is the best interpretation I can place on the term a “ well-filled” skull*. 


* It must be noted, however, that if the capacity be reconstructed from the three principal ares (see 
Biometrika, Vol. 111. p. 370) its value is found to be 1342-5, actually less than that reconstructed from the 


diameters (L, B, OH), i.e.1372°7. The mean of these two values (1358) is very close to the value obtained 
by packing with mustard seed. 
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The next value in excess of unity is that for the dacryal arc. Buchanan’s 
dacryal chord is just the average, but the dacryal arc is short. He must therefore 
have had a flat bridge to his nose. You will remember this type of nose occurs in 
the busts of Socrates. I think we must not on this account rush to the conclusion 
that Buchanan shared the wisdom of Socrates. Indeed modern medicine attri- 
butes such noses to courses of conduct hardly wise. But the point I would have 


you bear in mind is that a picture which has a narrow nasal bridge cannot be a 
true portrait of the owner of this skull. 


Passing down our last column (p. 238) we see that the next value which attracts 
our attention is the size of the orbits ; these, notably the breadth of the right orbit, 
are marked characteristics, especially when we consider that the skull is small. 
We must look out in the portraits for large orbits, especially the right one. 


The next character which attracts our attention is the markedly short palate ; 
but I think we must proceed here with great caution. I have marked the 
measurement with a double query, the palate is too much broken to have a good 
reconstruction, and the skull being short we might assume it to have a short 
palate. I am not certain of this however, for the auricular point is much in the 
average position, and accordingly the position of the basion as reconstructed cannot 
be far out, so that the skull base must be near the normal length, and the palate 
would not necessarily have been so short. The fact is that the chief peculiarity 
of Buchanan’s skull lies entirely in the occipital region, and as there is no profile 
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portrait this peculiarity, and naturally that of the palate, cannot be of service for 
the portraiture. 


We now pass to the length of the foramen magnum. This is again a doubly- 
queried measurement, but I think the foramen was very small; as it is of no service 
for portraiture we may anyhow pass it by. In going further down our column we 
notice some slight individuality in the fundamental triangle, but it is only a “one- 
man-in-six ” individuality and may be entirely due to a small! error in the recon- 
struction of the basion. We again need not stay over this point. 


Next we reach the two cephalic indices, which are naturally in excess, since 
the height and breadth are normal, while the length is very short. A cephalic 
index of 80:5 would be about 83 on the living individual. Out of 3377 living 
British persons I found 249 or 7-4°/ with a cephalic index of 83:5 or upwards. 
This is about 1 in 14 to 15, precisely what I have said before was indicated by a 
figure of 1:5 in the last column of our table. Now a cephalic index of 83 for the 
living head is by no means uncommon in Scotland. Buchanan is said to have 
sprung from Killearn in Stirling, which according to Dr Tocher’s measurements of 
Scottish soldiers is the most long-headed district of Scotland (mean cephalic index 
77:3). His name, however, if we discard the mythical Irishman, suggests a north- 
eastern rather than mid-western descent. But the best of Aberdonians cannot 
claim an average cephalic index of more than 784, in practical accordance with 
that of Britishers south of the Tweed. Whence George Buchanan’s round-headed- 
ness came we cannot say, so ignorant are we of many of the lines of his ancestry. 
It is possible, but by no means needful, to suppose foreign blood*. Unfortunately 
this very marked feature of the skull is of no avail for determining portraiture. 
As far as portraiture is concerned we are left with facial characters only. Most of 
these characters possess no marked individuality, they are close to those of the 
average British skull. But there are certain characters which are individual. We 
should not expect to find a long nose with a narrow bridge, but an average length 
of nose with a flattened bridge. We should expect not a long upper lip but one 
of moderate dimensions. We should not expect emphasised cheek bones, but 
rather a rotundity of face, and this would be surmounted by a hemispherical skull 
cap suggesting no great height of the forehead. There is nothing in the skull 


whatever to suggest a long oval face. George Buchanan was certainly not a “ high 
brow.” 


The skull itself will give you the best idea of these points (see Plates V to IX). 
It is difficult now-a-days to speak of a bullet-headed man, for the shape of a bullet 


* Professor Hume Brown says Buchanan came of the happy mixture of Celtic and Teutonic blood. If 
Celtic origin for Buchanan be assumed because one of his distant ancestors is said to have come from 
Ireland, and he himself could speak Gaelic, we must anthropologically assert that there is no such thing 
as a ** Celtic race,” only a mass of men of very different head shapes, whose ancestors spoke the same 
family of languages. As for ‘‘ Teutonic,” it is again a linguistic and not a racial term. Of a brachycephalic 
German ancestor—to account for Buchanan’s round-headedness and absence of occipital—I can find 
no trace in such pedigrees of the Heriots as exist. If by “ Teutonic,” we are merely to understand 
Lowland Scottish, these are identical with our Londoners and, so far from being essentially Anglo- 
Saxon, have for their nearest congeners the men of the Iron Age in Britain. 


! 

| 

4 


246 On the Skull and Portraits of George Buchanan 


has changed, but we may I think safely say the owner of this skull had an apple- 
shaped rather than a pear-shaped head. 


With this preliminary consideration of the skull we now turn to the portraiture. 
As in the case of many men of distinction there exist several incompatible types 
of portraiture, so for Buchanan there are divergent pictures which undoubtedly 
cannot all represent him. Those who start with the idea that a particular portrait 
is certainly that of Buchanan, citing written evidence for it of a certain grade 


of probability, but then find this portrait does not fit the skull, ought, I hold, 
to do one of two things : 


(i) to consider whether the evidence for the authenticity of the picture is 
more or less reliable than the evidence for the authenticity of the skull ; 


(ii) to consider whether the skull is more appropriate to other portraits which 


have, perhaps, less evidence of their authenticity, but which may after all represent 
the Simon Pure. 


What they certainly must not do is to disregard the skull straight away because 
it does not accord with the portrait they have selected as giving their own ideal of 
George Buchanan. That I take it was the line adopted by Mr William Carruthers, 
who simply dismissed the skull because it was not like one type of presumed 
portrait. Nor must anyone on the other hand, overlooking the fact that craniometry 
is a definite branch of science, assert that the skull possesses attributes which 
actually do not belong to it. A good illustration of this occurs in Dr Aeneas 
Mackay’s article on George Buchanan in the Dictionary of National Biography. 
He writes: 

“Tradition dating from a short period after his [Buchanan’s] death ascribed to 
him the skull preserved in the Anatomy Museum of the University, of which there 
is a print in Irving’s life, and which certainly resembles the best authenticated 
portraits of him which have been preserved, that by Boinard, engraved in Beza’s 
‘Trones’ and of which a copy is in the University of Edinburgh.” 

It is difficult to imagine more errors than are crammed into the last three 
lines of this statement, overlooking the grammatical blunder, by which “the best 
authenticated portraits” is suddenly reduced to the singular as “that by Boinard.” 


Now let us first be quite sure of our dates. George Buchanan died in 1582, 
aged 76. Beza’s volume of Jcones [not “Irones”!] was published in 1580; it 
contains no engraving of Buchanan, and no biography of Buchanan. In 1581 
Beza’s Icones was translated into French by Simon Goulard; there was no portrait 
of Buchanan in it. 

Only in 1673, more than ninety years after Buchanan’s death, Pierre Chouet 
published an anonymous volume at Geneva entitled: Les Portraits des Hommes 
Lllustres, qui ont le plus contribué au Restablissement des belles lettres et de la 
vraye Religion. 

There is no text to this work, beyond the list of portraits, and no explanation 
of its origin. All the blocks of Beza and Goulard are used, and many more besides. 
Among the latter is one of GEoRGE BvcaNnan (see Plate III). It has been too 
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hastily assumed that Beza had a block cut which neither he nor Goulard used, and 
that this was preserved for 90 years, to be published later in the following century 
by Pierre Chouet, also at Geneva. 

That portraits of John Knox and George Buchanan were sent to Beza is well 
known. They were sent by Peter Young, who in a postscript to a letter of Nov. 13, 
1579, addressed to Beza, wrote “Just as I am signing this letter the painter has 
opportunely come in, and brought in a box the likenesses of Buchanan and Knox*.” 

Further, we find in the Scottish Treasury Accounts: “To Adrian Vaensome, 
Fleming, painter, for twa picturis, and send to Theodorus Besa, conforme to ane 
precept, as the samin producit upon compt beris £8. 10s.” 

These pictures must have been painted when Buchanan was 73 years of age. 
The Beza portrait was not therefore, as Dr Mackay states, painted by Boinard (sic), 
but by Vansom+; it did not appear in Beza’s Icones, and there is no reason to 
suppose a copy of that particular portrait is now in the University of Edinburgh. 

But let us goa stage further. In 1598 J. J. Boissard (not Boinard, be it noted !), 
fifteen years after Buchanan’s death, issued his Icones Quinquaginta Virorum 
illustrium Doctrina et Eruditione. This is a work independent of Beza’s Icones, 
and in this is a portrait of George Buchanan (see Plate ILI)—“Georgivs Bvchananvs 
Ata sue 76.” This engraving is by Granthomme; Boissard was the author of 
the book, not the engraver. Now this engraving in Boissard’s Jcones opens a new 
field of inquiry. For while the crude engraving in Chouet of 1673 does appear to 
be the same man as in Boissard, it is not possible to believe that the Boissard 
copied the Chouet. In the first place there is no evidence that a man working 
in Frankfurt in 1598 would have access to an unpublished engraved block of 
Buchanan in Geneva, even supposing the block was at that time in existence. 
Further, and what is more important, the Boissard portrait is clearly a copy of 
the Bronckhorst type of portrait, and it bears the same age, 76}, as the picture 
in the National Portrait Gallery in London. It is not only the age which asso- 
ciates the 1598 engraving with the London picture, but the lobe and arrangements 
of the left ear, cap and locks of hair are alike, and differ from that of the St Andrews 
portrait of this type, which indeed bears the age 75 and not 76 (see Plate X). 
Other pictures of the Bronckhorst type, such as those in the Edinburgh University 
Senate Hall (with scroll in right hand) and at Glasgow University need not be 
considered from this standpoint. One item may, however, be referred to. The 
Granthomme engraving in Boissard has a large fur collar. Such a collar it is 
difficult to discover in the London or St Andrews pictures, but it does appear in 
the second University of Edinburgh portrait—that with a book and hands resting 
on a balustrade. The lobe of the left ear and the hair arrangement are however 


* Hume Brown : John Knox, 1895, Vol. 11. pp. 322—-324. 

+ No trust can be put in the Scottish spelling of foreign names in those days. This name may be 
identical with Vansome, in which form it also occurs, or with that of Vansomer. The best known painter 
of the latter name, as Mr H. M. Hake informs me, was active in the seventeenth century, but had pro- 
genitors of less importance in the sixteenth century. 

t+ Not the age 73 of the Vansom portrait be it noted. 
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very different (cf. Plates III, IV, X, XI and XIII). It is very difficult to believe 
that the Edinburgh balustrade portrait is an independent portrait of Buchanan 
made two or three years before the Bronckhorst—the attitude is so completely 
the same, and the motto upon it is the same*. A copyist might omit the hanging 
lobe of the ear, but he would be extremely unlikely to introduce it de novo. The 
Edinburgh scroll picture, the Glasgow picture—which is almost certainly an early 
copy—and the St Andrews portrait of this type, which is dated a year before the 
London, leave the left ear in obscurity. On this account I am inclined to give 
precedence to the London portrait over all these with its marked individuality 
of left ear. 

Another “balustrade” type of portraiture has come to my knowledge since my 
lecture; it is in the possession of Mr T. D. Findlay of Glasgow. It is difficult to 
say whether the Edinburgh or Glasgow portrait must be considered the original. 
The Glasgow portrait bears the same words “Sic Buchananus ora, etc.,” but has 
no statement of Buchanan’s age. The fact that the book and hands are thrown 
more into shade than in the Senate Hall picture, and a general impression of 
more life and less woodenness, are in favour of Mr Findlay’s picture; but on 
the other hand the details of the hair over the left ear and of the fur collar are 
much more highly elaborated in the Edinburgh picture. I must leave my readers 
to judge for themselves which picture is the earlier. See Plates XI and XI bis. 

Now it appears to be universally agreed by those who have written on the 
portraits that the London and the St Andrews pictures are by Bronckhorst, and 
that the Chouet engraving follows the missing portrait by Vansom sent to Beza, 
but if this be so, how does it come about that the Chouet engraving has for its 
right ear the peculiar lobe, the cap covering a part of the upper ear and the hair 
arrangement of the Bronckhorst left ear? We may suppose one to be a reversal 
of the other, made by the artist drawing directly onto the plate. But why should 
the cap be just covering the upper part of the ear and the curl be practically the 
same in two independent portraits, taken by different artists at an interval of two 
or three years?’ Further the London picture shows Buchanan with a black velvet 
cap, the Boissard-Granthomme engraving has this velvet, and the crumpling of 
the velvet is indicated by a series of “light” lines or patches descending to the 
forehead. These light effects are surely misinterpreted in the Chouet, and instead 
of a black velvet cap, when the wood-engraver sets to work, we have ruled lines, 
indicating a pattern on the cap, making it look like a printed cotton night-cap 
rather than the usual scholar’s head-dress. No such cotton night-cap appears any- 
where else in Beza’s [cones or Choue%’s engravings (see Plate III). I suggest that 
there is strong evidence that the Vansom painting of Buchanan never reached 
Beza or, if it did, he never had it engraved, but that Chouet, when 90 years later 

* Mr William Carruthers (George Buchanan, Glasgow Quatercentenary Studies, 1906, Maclehose and 
Sons, 1907, p. 359) associated this portrait closely with the London one, without saying that it is 
acopy. He says that on the upper right corner is written “Ann. Aet. 76,” i.e. the same date as the 
London picture, but my photograph of it has quite clearly, not on the upper right, but the upper 


left corner, ‘‘ Aetatis 73,” which if we could trust it, would make this picture three years older than the 
London picture and the subject’s age the same as that of the Vansom picture. 
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he reproduced Beza’s Icones with 40 new portraits, got a wood-cutter to copy in 
crude outline without reversing Granthomme’s engraving of 1598. It is easy 
therefore to explain how “one man” has been said to be represented by the 
Vansom-Beza and the Bronckhorst-Boissard engravings. We do not get proof of 
the truthfulness of the Bronckhorst portrait if, as I think must be held, the 
Chouet is merely a crude copy of it. What also becomes of Sir William Hamilton’s 
statement that the Chouet must represent the real man because it fits the skull 
better? It fits the skull better because the crude wood-cutter has altered the 
proportions of height to breadth of face. He has given the pear-shaped London 
face a more apple-shaped form, while preserving the disproportionately long nose, 
and (for the skull) impossible nasal bridge. 

Those who hold that the Chouet woodcut was really prepared from the Vansom 
painting of 1579, when Buchanan was 73—and when considering this they must 
remember the Edinburgh balustrade picture, which is given as of age 73—must 
give up the idea that the London National Gallery picture is the Bronckhorst 
painting, notwithstanding its date, and suppose it also to belong to the Vansom 
group. I think they will find it easier to suppose that Chouet copied from Boissard. 

We must now ask what is. the evidence that associates the London and 
St Andrews portraits with Bronckhorst, and makes the Edinburgh and Glasgow 
pictures of this ancestry. 

The evidence consists in Bronckhorst’s bill to James VI, and his majesty’s 
order for its payment, dated September 1580. The account runs: “Certane 
portraitures maid by me at his maiesties commaund and delyuerit laitlie to his 
hienes quherof I have resauit as yit na payement.” The first is a portrait of 
“his maiestie” from the belt, and the price “xvj lib.” The second item runs: “Ane 
other pourtraiet of Mr george buchanane. viii lib.” (only half as much for a 
subject’s portrait as for a king’s!) and the third item a full length portrait of 
“his maiestie ” xl lib., which shows the value of the royal legs. 

The Buchanan item “Ane other pourtraiet”” may mean that the first portrait 
was that of the King, but it might also mean that Bronckhorst had painted a 
previous portrait of Buchanan. Arnold van Brounckhurst* must have painted his 
picture or pictures before September 1580. The St Andrews portrait has on it 
the date 1580, and the age 75; the London portrait is dated 1581, with the age 76. 
Both portraits are clearly closely related to each other, the St Andrews being in a 
much worse state of preservation. 

In 1580 James was only a boy 14 years of age, and he may or may not have 
wished for a portrait of his tutor. The order for payment is countersigned by 
Angus and Argyle, who then dominated the situation and may themselves have 
desired the portrait of Buchanan. What James thought of Buchanan and his 
works in his later years is well known, and in the lists of paintings belonging 
to Charles I prepared after his execution, no portrait of Buchanan appears in royal 
ownership. 

* Thus the name seems to be spelt in the Scottish records. This artist was probably related to the 
better known and better spelt Bronckhorst of the seventeenth century. 

Biometrika xvi 17 
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Now without denying that the whole group of portraits to which we are 
referring may have originated in the Bronckhorst painting, there is still some- 
thing obscure about them. In the first place there is the inscription on three of 
them, which runs: “Sic Bvchananvs ora, sic vvltvm tvlit. Pete scripta et astra, 
nosse si mentem cvpis.” 


Mr Carruthers translates this as follows: “This is the face and these are the 
features of Buchanan. If you want to know his mind, search his writings and 
the stars.” 


Now I have forgotten the little Latin I ever knew, but I should translate the 
sentences: “Thus Buchanan bore his features, thus his face. If you seek his 
spirit, search his writings and heaven.” 


In other words the verb is in the perfect and strongly suggests that Buchanan 
was dead at the time the writing was affixed to the picture. His “ mens,” the 
“divina mens,” is to be found only in his writings and heaven. Further, Buchanan 
died on September 28, 1582, and is said to have been born in February 1506, or 
if that be Old Style, February 1507; he could therefore have only been 75 or at 
most 76 in the year of his death, 1582, while the London portrait makes him 76 
in 1581 and the St Andrews portrait 75 in 1580. Thus the dates on these pictures 
appear to be erroneous, and I believe the dates and inscriptions must have been 
put on, and probably the pictures painted, after Buchanan’s death. They may all 
have been copied from an original Bronckhorst which has not survived to our 
day. If they are copies and not only copies from the Bronckhorst, but one from 
another, we can easily explain the growth of idealised features and the emphasis of 
an intellectual spirituality, which may have been far less intensely present in 
the actual man himself. 


In connection with this type of portrait two engravings occur, copies of which 
are in the Print Room of the British Museum (see Plate XII). The first of these 
is closely related to the Edinburgh “balustrade” picture. The portrait has an 
ornamental border by Karel van Sichem, and it is said to be painted by an artist 
whose monogram, there given, has not been identified. The engraving itself is 
by a second van Sichem, possibly Christoph, the son of the former. The original 
ornamental border enclosed an oval portrait which has been cleaned off and 
replaced by a square frame containing Buchanan. It has been suggested that the 
print was prepared in Cologne early in the 17th century. One would suspect 
that it was issued as a frontispiece to some edition of one or other of Buchanan’s 
works, published in Cologne or the Netherlands, but I have not hitherto been 
able to identify it. It has the usual lines “ Sic Buchananus ora, etc.” A second 
print (see also Plate XII), obviously a copy of the van Sichem, was drawn by 
Elias Boschius (van der Bosche?) and engraved by Johan Bussemecher. This is 
still the Bronckhorst type, but a very inferior copy of the van Sichem. If the 
statement on the van Sichem plate be correct, the Edinburgh balustrade 
portrait, however much it resembles the Bronckhorst type, was not itself painted 
by Bronckhorst, but by the hitherto undetermined artist of the monogram. 
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We will shortly determine to what extent the skull fits the Bronckhorst 
group of portraits. Meanwhile we ask what other portraits or pseudo-portraits 
are available. A very useful graphic index to the various types of portraiture of 
Buchanan was provided by Mr Maitland Anderson in his pamphlet issued for the 
St Andrews Quatercentenary*, which he has kindly permitted me to reproduce. 
It shows four pictures of the Bronckhorst type, two of the Buchan type, and two 
of the Povey type. These will be found useful for showing at a glance the 
different types of portraiture with which we have to deal (see Plate XIV). 


In the first place there is the Earl of Buchan’s so-called “Titian” and its repro- 
ductions by Raeburn and others. This picture is at St Andrews, and is not to be 
confused with the St Andrews picture of the Bronckhorst type. The person 
represented is not the Buchanan of any other type of portraiture ; the face does not 
fit the skull, and there is little doubt that it is a copy of the portrait of President 
Jeannin of Dijon which hangs in the Musée of that city (see Plate XV). We can 
therefore at once dismiss the “Titian” type as no portrait of Buchanan at all. 


The next type requires far more consideration. We are certain of at least three 
portraits of this character, which I will call the “ Royal Society or Povey type.” 
Thomas Povey was a member of the Long Parliament in 1647, so it is reasonable 
to suppose that he was born between 1610 and 1620, i.e. from 30 to 40 years 
after the death of Buchanan. He was a friend of Pepys and an original member 
of the Royal Society from 1663. To that Society he presented a portrait of George 
Buchanan, which has painted on it: Georgivs Bvchanan Scotvs (see Plate XVI). 

The costume is that of Buchanan’s date. The face is wholly unlike that of the 
Bronckhorst type. Now the portrait has been some 260 years in the possession 
of the Royal Society, and therefore the ascription to Buchanan is not a recent one. 
It carries us back to a date, when probably some of those living had seen Buchanan. 
Povey was a favourite at Court, and Charles II, the grandson of Buchanan’s pupil, 
who frequented the meetings of the Royal Society, could hardly fail to have seen 
portraits of his grandfather’s tutor. Mr Carruthers, who dismisses this portrait, 
says that George Buchanan was a common name in Scotland; that two George 
Buchanans might have had their portraits painted, and that the inscription only 
says “George Buchanan, a Scot.” I think it would be more reasonable to translate 
it: “George Buchanan, the Scot,” and I think there was only one man of that 
name who reached fame at that date. Now it is quite clear that the word Scotus 
is significant. If this George Buchanan is not the real man, but some rich merchant 
who visited the Netherlands and had his portrait painted there, as Mr Carruthers 
suggests, why was Scotus put on the picture? It was hardly needful to identify 
him in the family portrait gallery! Is it not probable that this portrait, whoever 
it may be of, was being sent abroad—to England, to Denmark, or even to Geneva,— 
in which case the Scotus was appropriate ? 

Further, in all the editions of Buchanan’s works issued in his life, the author on 
the title page appears as “Georgius Buchanan Scotus,” which is certainly “George 


* The Writings and Portraits of George Buchanan, University Press, St Andrews, 1906. 
17—2 
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Buchanan, the Scot.” But Povey’s George Buchanan does not stand as a sole 
representative of this type. Dr Richard Mead, the celebrated physician, born in 
1673 (d. 1754) was a man of wide knowledge, who had studied under Graevius in 
Utrecht and Boerhaave in Leyden. He had travelled in Italy and in other parts 
of Europe and amassed a great library, a gallery of well-chosen pictures and a 
collection of coins and other antiquities. His library alone contained 10,000 volumes 
and his pictures, books, etc. sold for more than £16,000 on his death—a very large 
sum for those days. He built for his pictures and antiquities a gallery attached to 
his house in Great Ormond Street, London. Among his pictures was a portrait of 
George Buchanan. This painting is almost certainly the one now in possession of 
the Duke of Sutherland. It was engraved by Houbraken in 1741 and the picture 
was then attributed to Porbus (see Plates XVII and XVIII). I do not think this 
portrait was a copy made for Dr Mead of the Royal Society picture, but it is un- 
doubtedly of the same man and must, judging from the hair, have been taken at a 
somewhat later time. Why should the supposititious rich merchant of Edinburgh 
have had no less than three paintings of himself? How did two of them come to 
London to fall into the hands of two fellows of the Royal Society ?—I think these 
portraits tend mutually to strengthen each other, and to give the Royal Society 
type a strong claim to be considered when we compare skull and portraits. 

Still further confirmation of the originality of this type of portrait has turned 
up since this lecture was given in Edinburgh, namely a picture in the possession of 
Mr George K. Sowersby, which is also a portrait of early date with the same ex- 
pression of features but without any title (see Plate XIX). Mr Carruthers’ worthy 
merchant of Edinburgh must indeed have been fond of having paintings made of 
himself. It is difficult to place exactly Mr Sowersby’s portrait. It is certainly not 
a copy of the Royal Society portrait, being much closer to the Dunrobin picture. 
There is no doubt that it is the same man, but it would be unwise to assert at 
present which is the original. 

Like Dr Mead’s portrait the Royal Society’s has been attributed to Porbus, 
a Flemish artist, but more recently to Adrian Keij. It is clearly Flemish work. 
There is no evidence that Porbus, either father or son, came to Scotland, and little 
certain evidence that Buchanan visited the Netherlands. But the difficulty about 
a Flemish painter is not really great, as Flemish painters were by no means 
unknown in Edinburgh. Now we know that Tycho Brahé in his Observatory at 
the Uranienberg fitted up for him by Fredrik IT of Denmark had a portrait of 
George Buchanan. Brahé and Buchanan had been in correspondence in 1576. 
Tycho Brahé travelled in Germany and Switzerland in 1575, and possibly then 
made the acquaintance of Beza and through him of Buchanan. It must also be 
remembered that from 1560 onwards Buchanan had been busy with his poem 
De Sphera, which takes the anti-Copernican view—what we might in modern 
language term the “Fundamentalist” view—of the universe. This was also 
Brahé’s standpoint, and Buchanan may well have sought his advice and _inter- 
pretation against the Copernican views of the more advanced school of heretics. 
Buchanan’s pupil James VI recognised this portrait of his old tutor, when on going 
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to fetch his wife Anne of Denmark he visited Tycho Brahé at the Uranienberg. 
Brahé left Denmark in 1597 and died in Prag in 1601. Buchanan’s portrait has 
been sought in vain in Denmark and Bohemia. If it was left at the Uranienberg 
it might well pass back to the Danish Royal Family with the observatory, and 
may have been sent to Queen Anne as of more interest to her husband than to the 
Danes. But what has this to do with Povey? Just this, Povey was son of Justinian 
Povey who was treasurer to Anne of Denmark, that frivolous but kindly-hearted 
queen, who never could keep her expenditure within her income, and was con- 
tinually pledging her jewels and valuables even to her treasurer. If Buchanan’s 
portrait came from the Uranienberg to Queen Anne it may very possibly have passed 
to her treasurer Justinian Povey, and thence to his son Thomas who ultimately 
gave it to the Royal Society*. All these stages are only conjectures, but not very 
improbable conjectures. It has been suggested that Peter Young, who was in 
Denmark in 1585 and 1589 about the marriage of James to Anne, presented a 
portrait of Buchanan to Tycho Brahé, but it is not very clear why he should have 
done so either 3 or 7 years after Buchanan’s death. It seems more reasonable to 
suppose that Buchanan himself sent his portrait to Brahé. The picture, while it 


may represent a man well on in the sixties, is more appropriate to that age than 
to the seventies. 


Whence Dr Mead obtained his portrait it is difficult to suggest; if as seems 
probable, it is the one now in the possession of the Duke of Sutherland at Dunrobin 
Castle, then the picture is of the same man as the Royal Society portrait, but as 
I have said he is several years older, for the hair of the head, moustache and beard 
are all grey. Is it conceivable that Dr Mead’s portrait was the Vansom sent to 
Beza? If that were so, then the Tycho Brahé and the Beza portraits need no 
longer be treated as missing, and we should have a second type of Buchanan very 
different from the Bronckhorst type. 


Mr Carruthers speaks of the Povey-Mead type of Buchanan portrait as if the 
tradition of this portrait type had started with some error of assignment by Povey 
in the last quarter of the 17th century, and that from Povey’s error had arisen 
the whole tradition of this portrait type representing Buchanan. Mr Carruthers 
seems to have entirely overlooked the facts that: (i) the Povey and Mead portraits 
represent the same man at different ages and (ii)—what is far more important— 
the so-called Povey type is the type which appears in the Elzevir edition of the 
Poemata in 1628, which is obviously engraved from a picture of the Povey type, 
if not from the Povey picture itself. In 1628—James I only died in 1625—there 
, must have been persons still living who had known Buchanan in the flesh, yet 
| this passed as Buchanan’s likeness not only in this edition of his poems, but in 
those issued at Amsterdam in 1676 and 1687, until it appears in a highly idealised 
form in Ruddiman’s great folio edition of the Opera Omnia published in Edinburgh 


* An absurd statement occurs on p. 534 of the Glasgow Quatercentenary Studies (George Buchanan), 
1906, where we are told Povey presented the picture to the Royal Society in 1633, i.e. about 30 years 
before the foundation of that Society and when Povey was a very young man just beginning to study 
for the Bar! He lived on into James II’s reign, and probably died towards the end of the century. 
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in 1715 (see Plates XX and XXI). In other words the Royal Society type was as 
well known and as often—perhaps more often—published in the century which 
followed Buchanan’s death as the Bronckhorst type. Despite Mr Carruthers, 
traditional evidence for the Povey type is as strong as for the Boissard-Bronckhorst 
type. It is not until Burmann’s re-issue of Ruddiman in 1725 that the Boissard- 
Bronckhorst type replaces the Povey type in the Opera (see Plate XXII). 


It has been said that there was no opportunity for a painting being made of 
Buchanan when he was about 60 years of age. This overlooks the fact that in 
1565—6 Buchanan visited Paris to see his Latin paraphrase of the Psalms passed 
through the press by Henri Estienne and his brother *. On the title page of that 
work Buchanan appears as Georgius Buchananus Scotust+ “poetarum nostri saeculi 
facilé princeps.” A man with such a reputation might well have his portrait 
painted. How Buchanan travelled to Paris we do not know, very possibly through 
the Netherlands, and there is a remarkable letter of Languet to Buchanan stating 
that he lives in sight of Rotterdam which reminds him not only of Erasmus but 
also of Buchanan. This suggests that he had met Buchanan in Rotterdam. 


While the Bronckhorst and Royal Society types show Buchanan as “aged” 
if not “agéd,” there exist two other portrait types quite incompatible with the 
Bronckhorst type (but more or less compatible with the Royal Society type) which 
represent, or profess to represent, Buchanan aged about 30 and 50 respectively. One 
of the former type was originally at Hamilton Palace, and from it Harding made 
an engraving for Pinkerton’s “Scottish Gallery” (see Plate XXIII). By courtesy 
of the Duke of Hamilton, I have been informed: (1) by the Hamilton Estates 
Trustees that no such portrait now exists in Hamilton possession, and (2) by 
Messrs Christie, Manson & Woods, that no such portrait was sold with the 
Hamilton Palace collections. A portrait of this type on canvas has, however, 
turned up in the possession of Mr H. B. Buchanan, which is undoubtedly an old 
painting. It has been in the possession of his family for many years and may 
possibly have reached them by purchase or gift from Hamilton Palace. I am very 
grateful for the permission to reproduce it in this paper. A second, somewhat 
idealised, version of this portrait is now in the Technical College at Glasgow, 
which again may be the original Hamilton Palace picture (see Plates X XIII, Right, 
XXIV and XXV). 

One of the second type of portrait—i.e. of Buchanan about 50 years of age— 
is in the possession of Miss Christie of Bedlay. By her permission and that of 
the publishers of the Glasgow Quatercentenary Studies (George Buchanan), I have 
been allowed to reproduce it (see Plate XXIII, Left). 


While the skull of an adult changes to some slight extent with age the amount 
is so small as to be quite insignificant compared with the personal equation of an 


* Ruddiman in his Anticrisis, 1754, writes : ‘‘I have heard it related a hundred times that Buchanan 
when Principal of St Leonard’s College, at St Andrews, did make such a voyage to France.” 

+ It must be emphasised that the word Scotus almost invariably follows George Buchanan’s name 
in any of his works published in his lifetime, and is thus in keeping with the title on the Royal 
Society portrait. 
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artist. His deviations from truth, even in the case of the most conscientious of 
the craft, are of quite a different order of magnitude. We can have no hesitation 


then in applying the Edinburgh skull to the portraits reputed to be those of 
Buchanan in the years from 30 to 50. 


I will now run rapidly through the portraits. I have to thank Miss Ida 
McLearn for measured drawings of the portraits to each of which we have attached 
outlines of photographs of the skull in the corresponding attitude. 


(1) Portrait of a Young Man. Glasgow Technical College Portrait. 


With a slight allowance for artistic inaccuracy the skull might almost fit this 
portrait. We cannot, however, raise the alveolar margin adequately without 
getting the orbits and nasal bridge out of position. The frontals of the skull are 
scarcely high enough for the forehead of the portrait (see Plate XX VI, Left). 


Allowing for artistic inaccuracies I do not think we must venture to say on the 
authority of the skull that this type of portrait does not represent Buchanan when 
young. 


(2) Miss Christie’s Portrait of a Man between 40 and 50. 


Again this is not such a very bad fit. Forehead too high and broad. Cheek 
bones about right and mastoid fairly good. Alveolar border too low and cannot be 
raised without sending orbits out of position (see Plate XX VI, Right). 

It cannot be affirmed on the evidence of the skull that this is certainly not 
Buchanan. 


(3) Chouet (?Vansom). Right (? left reversed) mastoid out of position, left cheek 
bone projects beyond portrait without flesh. Alveolar margin is under nose instead 
of under upper lip. Excess of right frontal. Would be a very bad portrait of the 
owner of the skull, if portrait at all (see Plate XX VII, Left). 


(4) George Buchanan from Boissard’s Icones, 1598. Left mastoid out of position, 
right cheek bone protruding beyond portrait without flesh. Alveolar margin of 
upper jaw under nose instead of under upper lip. Excess of right frontal. All the 
defects of the Chouet portrait, of course reversed (see Plate XX VII, Right). 


(5) George Buchanan, National Portrait Gallery. This may stand as repre- 
sentative of the two Edinburgh, the St Andrews, the two Aberdeen and the Glasgow 
portraits. They are all of the same, the Bronckhorst, type (see Plate XX VIII). 


Left mastoid out of place, nose much too long, so that it reaches to alveolar 
margin. Right cheek bone without flesh outside limits of portrait. Right frontal 
in excess. It is impossible to defend this portrait as a truthful portrait of the 
owner of the skull. Those who accept this portrait as a representation of George 
Buchanan must either discard the skull, or be willing to admit and approve extreme 
artistic licence. 


(6) Earl of Buchanan’s “ Titian,” St Andrews (Copies at the Buchan Club, etc.). 
I do not know whether it is worth the energy it has taken to dismiss this portrait 
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of President Jeannin, but I have done so. The orbits are out of position, the alveolar 
margin absurd, and the forehead ridiculous for the skull (see Plate XXIX, Left). 


(7) The Royal Society's Portrait. The orbits are reasonable, so is the nose. 
The alveolar margin is in the true position; the cheek bones practically where 
they should be, and the frontals excellent. If the skull is to be our guide, this 
could hardly be bettered (see Plate XXX). On Plate XXIX, Right, will be found 
another illustration of this type, namely Mr Sowersby’s portrait fitted with the 
skull. As in the case of the Royal Society picture the skull fits well. 


We are, I think, driven to the conclusion that the skull accords with the Povey- 


Mead type of portrait, the type of which our earliest trace of authenticated date 
is the Elzevir Poemata of 1628. 


Now what exactly does this mean for us as phrenologists ? Let us look on the 
two portraits once more. 


Let us take the highly idealised Bronckhorst portrait in Edinburgh as one 
type (see Plate XIII). Note the high forehead of great breadth, the marked 
bridge and long nose, the projecting cheek bones and stern look. We have here 
all the signs of the earnest religious reformer, the ascetic scholar and the stringent 
moralist ; Buchanan as a second Cato. 


Now look at the Royal Society picture (see Plate XVI). We see the exaggerated 
right orbit, the Socratic nose, and all the rotundity which identify man and skull. 
The portrait suggests the intimate of Muretus, the man who preferred a pawky 
story to true history, the man who could write poems worthy of Catullus; the 
man who could appeal alike to the surpassing talents of Henry VIII and of the 
Regent Moray, or sing the virtues of Mary Stewart and of Elizabeth Tudor, accord- 
ing as the moment was propitious for receiving their aid. 'The two portraits express 
the two attitudes that historians have taken to Buchanan, and whether we like it 
or not, each one of us has got to take his choice between Cato and Catullus. For 
as Buchanan’s friend Muretus laid it down: 


“ Quisquis versibus exprimit Catullum, 
Raro moribus exprimit Catonem.” 


The man who in his verses propounds Catullus, will scarcely exemplify Cato in 
his morality. 

Here is a choice between alternatives*. Either you must reject the Cato 
portrait as no truthful likeness, or if you accept the Cato portrait as a truthful 
representation, you must return the skull to the Greyfriars as not that of George 
Buchanan, although it fits excellently that other portrait of “Georgivs Bvchanan 
Scotvs.” The alternative lies in how you translate “Scotvs.” 


* The Chairman, Sir E, A. Schiffer, suggested at the end of the lecture that the two portraiture 
alternatives were those of Don Quixote and Sancho Panza. This somewhat apt description of the 
Edinburgh “ Scroll’? and the Royal Society pictures has really no application to the alternative interpre- 
tations of the indisputable mental ability of George Buchanan. 
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The Granthomme Engraving from the Boissard, 1598. 


The Chouet Woodcut, 1673. 
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Biometrika, Vol. XVIII, Parts III and IV Plate IV 


Pearson, Shull and Portraits of George Buchanan 


Bronckhorst type, from the National Portrait Gallery, London. Asserted date, 1581. 
| Reproduced by permission. | 
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Biometrika, Vol. XVIII, Parts III and IV Plate IV (dis) 


Pearson, Skull and Portraits of George Buchanan sf 


VCHANANYS 


Bronckhorst type, from the Landesbibliothek, Wolfenbiittel. A copy made 14 years after 2. 
Buchanan’s death either from the National Portrait Gallery (London) painting, or more 
probably from a missing picture from which the latter also was copied. [By kind permission. | 
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Pearson, Sku/l and Portraits of George Buchanan 
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Buchanan’s Skull; Norma Facialis. 
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Pearson, Skull and Portraits of George Buchanan 


Buchanan’s Skull; Norma Verticalis. 


Plate VI 
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Biometrika, Vol. XVIII, Parts III and IV Plate VII 
Pearson, Skull and Portraits of George Buchanan 


Norma Lateralis (Right Profile). 


Buchanan’s Skull 
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Pearson, Skull and Portraits of George Buchanan 


Norma Lateralis (Left Profile). 


Buchanan’s Skull 
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Pearson, Skull and Portraits of George Buchanan 


Buchanan’s Skull; Norma Occipitalis. 


Plate IX 


| 
| 
| 
| 
— 
— 
| 
‘ 
aad 


Biometrika, Vol. XVIII, Parts III and IV Plate X 


Pearson, Shell and Portraits of George Buchanan 
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Pearson, Skull and Portraits of George Buchanan 


‘ Balustrade ” Portrait, from the Senate Hall, Edinburgh University. Asserted date, 1579. 
Clearly Bronckhorst type. | By kind permission. | 
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Pearson, Skull and Portraits of George Buchanan 
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Portrait of George Buchanan of the Edinburgh ‘‘ Balustrade ” type in the possession 
of Mr T. D. Findlay of Glasgow. [By kind permission. | 
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Pearson, Shull and Portraits of George Buchanan 
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Biometrika, Vol. XVIII, Parts III and IV Plate XIII 


Pearson, Shull and Portraits of George Buchanan 


Highly idealised portrait of the Bronckhorst type, in the Senate Hall, Edinburgh 
University. Note the disappearance of the lobe of the left ear. Probably a rather 
late copy, [By kind permission. | 
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Pearson, Skull and Portraits of George Buchanan 
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Pearson, Shell and Portraits of George Buchanan 
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Pearson, Skull and Portraits of George Buchanan 


The Royal Society or ‘‘ Poemata” type of Buchanan portrait, 
originally attributed to Porbus, possibly by Adrian Keij. 
[By kind permission. | 
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Biometrika, Vol. XVIII, Parts III and IV Plate XVII 
Pearson, Skull and Portraits of George Buchanan 


The Dunrobin Portrait of Buchanan of the ‘ Poemata” type, formerly in the possession 
of Dr Mead, and now in the possession of the Duke of Sutherland. 
| By kind permission. | 
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Pearson, Skull and Portraits of George Buchanan 


Plate XVIII 


The Houbraken engraving (reduced) of Dr Mead’s portrait which identifies 
the latter with the picture now at Dunrobin Castle, 
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Biometrika, Vol. XVIII, Parts III and IV Plate XIX 


Pearson, Skull and Portraits of George Buchanan 


Mr Sowersby’s Portrait of Buchanan of the “Poemata” type, 
said to have been formerly in the possession of the Dowager 
Countess of Seafield. [By kind permission. | 
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Pearson, Skell and Portraits of George Buchanan 
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Biometrika, Vol. XVIII, Parts III and IV Plate XXI 


Pearson, Sku// and Portraits of George Buchanan 


cs, apid RobertamFnbarnum TypcgraphumRegiun. Dom 


Title-page of the first Edinburgh Edition of the Opera Omnia, by Ruddiman 
in 1715, showing a highly idealised medallion of Buchanan of the “Poemata” 
type of portraiture. [Reproduced, much reduced, from a copy in the possession 
of Dr James Bonar. } 
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Pearson, Skull and Portraits of George Buchanan 
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Re-introduction of the Bronckhorst type in the re-issue by e 
Burmann of the Opera Omnia, 1725, [Reduced from quarto . 
format. | 
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Pearson, Skill and Portraits of George Buchanan 
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Pearson, Skull and Portraits of George Buchanan 
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Biometrika, Vol. XVIII, Parts III and IV Plate XXV 
Pearson, Skull and Portraits of George Buchanan 


Portrait of the youthful Buchanan in the Technical College, Glasgow. Probably : 
a copy of that originally at Hamilton Palace. [By kind permission. | 
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Pearson, Skull and Portraits of George Buchanan 


Plate XXVI 


George Buchanan 
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Pearson, Skull and Portraits of George Buchanan 


Plate XXVII 


George Buchanan 


SUPPOSED VAN sont 


George Puchanan 


Chouet Woodcut. 
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Pearson, Skull and Portraits of George Buchanan 
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FURTHER NOTES ON NON-LINEAR REGRESSION. 
By J. NEYMAN, Pa.D., Warsaw. 


1. THE theory of non-linear regression has been discussed by Prof. K. Pearson 
in his memoir On the General Theory of Skew Correlation and Non-Linear Re- 
gression (London, 1905), and in a paper in Biometrika, Vol. x11. pp. 296—300. 
In these two papers the general method of calculation of polynomials, which are 
approximations to the unknown skew regression line, is given and formulae are 
deduced expressing the coefficients of such polynomials as far as the fourth order 
in terms of the frequency constants of the population considered. 


The results of Heine* combined with those of H. Hamburger+, who worked 
with the theory of continued fractions, allow us to put the results of K. Pearson 
into a more genera] form. 


We shall do it, however, without any appeal to the theory of continued 
fractions. 


In a further paper we shall apply these results to the correlation between the 
standard deviation squared and the mean in a sample drawn from an indefinitely 
large population. 


Higher Parabolae of Regression and Higher Correlation. 


2. Let # and y be two variates, and z = f(a, y) the function representing their 
frequency distribution. The two variates may have their possible values forming 
a continuous set of points (as it is, e.g., in the case when their distribution is 
normal), or not (as it is in the case when the distribution is, e.g., binomial). We 


shall use nevertheless the notation of . which in the last case is to be considered 
as the integral of Stieltjes, i.e. as a sum, 

The integrals over the entire range of each of the two variates we shall denote 
by | ae as in the case when the range is limited, it is always possible to suppose 


that outside this: range 


Further, we shall suppose that the integral of f(#, y), taken over the whole 
of the ranges of # and y, is equal to unity. 


* Handbuch der Kugelfunctionen, 1878—1881, 1. p. 292. 
+ ‘*Ueber die Erweiterung des Stieltjesschen Momentenproblems,” Math. Annalen, Bd. Lxxxt. 
Heft 2—-4, 1920. 
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258 Further Notes on Non-Linear Regression 


Let 7, be the mean value of y corresponding to a given value of « We have 


+o 
+00 
In trying to approximate to the generally unknown function ee) we shall 
calculate a polynomial of the n-th degree 
satisfying the condition, that the integral 


isa minimum. If we observe, that p(«) dw is nothing but the probability that the 
variate « should lie in the subrange da, the meaning of this integral is clear. 

As / is always a differentiable function of the a’s, we use the ordinary method 
for finding minima, and solve the system of n +1 linear equations in the a’s, given 


by equating to zero the consecutive derivates = (k= 0,1, 2... n), namely, 
k 


where py, = 3 | (x) da = | | yx f (x, y) dady...(8). 


If the origin of the coordinates be at the mean of the two variates, we have 
=2%»)=0. We notice also that 1. 


As the determinant of the system of equations (7) 


| Mo, Mk Bn 

Pa, vee Betis Mnti | 

Pk+1> + Pek» | 


is always >0*, system (9) gives solutions for the a’s which are not all equal to 
zero when, and only when, the d,’s are not all equal to zero. Such zero solutions 
have, however, no interest for us. Considering the system of n +2 equations 


—YH+A + + +...+ a,2"=0 
and — + + + + = 0 ...(10), 
(k=0, 1, 2...n) 


* T. J. Stieltjes, “Recherches sur les fractions continues,” Ann, de la fac, des sc. de Toulouse, 1894. 
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we can free ourselves of the a’s and write the necessary and sufficient conditions 
that they should be satisfied as 


—%, fis Bk» Bn 

—™, Pa, Mas 

= +++ Mntk> +++ Mon 


which is really the expression of the n-th approximating parabola 


required. It is clear, that the sign of the terms in the first column can be altered. 

We can now proceed to the expression of the n-th correlation ratio. 

With these words we denote an expression, which is connected with the n-th 
parabola of regression (12), in the same manner as the coefficient of correlation is 
connected with the same parabola for n =1 (i.e. with the straight line fitted to the 
unknown correlation line) and as the well-known correlation ratio » is connected 
with the actual regression line. 

In this way we shall get a measure of the closeness of points (#, y), representing 
individuals of the population considered, to our n-th parabola of regression. 

Such a “natural” measure of closeness is represented by the very familiar integral 


+a fto 


in which P, means the polynomial, giving the n-th parabola of regression. If we 
suppose for a moment that the coefficients of P, in (13) are variable, the integral 
(13), which we may now denote by =,,”, will be a function of them, and it is easy 
to see that if we try to dispose of these coefficients to make >,’? a minimum 
we shall find the same equations (7) to determine them. Therefore >,? is the 
minimum value of =, and we shall call it the mean square deviation of individual 
observations from the n-th regression-parabola, or shortly, the n-th mean square 
deviation. To get its expression in terms of y’s and 2’s, we consider the following 
function of the a’s and a new variate b, 


+o 
| | [by = Pal f(y) dedy (14). 


It is clear that &)° is a homogeneous polynomial of the second degree in the 
a’s and 6. Therefore, using the Euler formula, 


s = (b) 


for all values of the a’s and b. But putting b=1, we have S»*=*,’, and as the 
a’s satisfy the conditions (7), ie. as all 
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260 Further Notes on Non-Linear Regression 
the equation (15) becomes, for b = 1, 
\2 +o 
2,7 = = 2| | y (y — Pa) f (a, y) (17) 
Using in the ordinary notation o, for the standard deviation of y, or putting 
and dividing the two parts of (17) by 2, we have 


Comparing this expression with (12), we see that to get =)? we have only to 
put into (11) o,? instead of y and d, instead of #* (k =0, 1, 2... mn), and to divide 
by the coefficient of 7, i.e. by —A,. In this way we have 


No, Fo> Mis 
| An, Pnti «++ Mon 
where 
0, Ad, Ax, An 
Fy An| — Xx, 


— Bn, +++ Mntk>s «++ Bon | 


which is the expression of the n-th correlation ratio we wanted to find. It is easy 
to see that for n =1, we have 


which is the expression of the ordinary coefficient of correlation squared, 7,,7*. We 


see further, as the equation of the n-th regression parabola (11) can be also written 
in the following manner, 


0, a, a, ak, a” 
— Fo» Bey Mky Mn 
y= — =—~ (say)...(24), 
| — Metis Mek, An An 
| Mantis Pntks «++ Pon 


* As H,? is nothing but the most natural generalization of the coefficient of correlation squared, it 
would be perhaps better to call it “the n-th coefficient of correlation.” Unfortunately this name is 
adapted for what I should call ‘‘ the coefficient of correlation between n-th powers of variates.” 
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that, having the equation of the n-th regression-parabola, we get at once the expression 


of H,2 by putting H,; instead of ¥ and Me instead of the k-th power of «(k=0,1...). 
It is necessary to remember that « and y are measured from their means. 


Expansions of P, and 
3. We shall find it of interest to calculate the general expression for the 
difference 
As P, = 0, we shall be able to express the right-hand side of the equation of the 
n-th regression-parabola in the form of a sum ; 
In this way we shall reach the expressions for successive regression-parabolas, 
without any appeal to the theory of continued fractions and of orthogonal functions. 


For n<4 these expressions have been given by Prof. K. Pearson in Biometrika, 
loc. cit. 


We shall need here the following property of determinants. Let D be a deter- 
minant and D,. ¢\i;,. the determinant which we obtain from D by cancelling the 
r-th, s-th...¢-th rows and the same number of columns, namely the 7-th, j-th... k-th 
columns. 

We have Dx = Daum Dim 
where m > 1. 


If Dy, and Diya are not equal to zero, we can write 
(28), 
Diy, Dray Drain 
| the identity which we shall apply to our expressions of H,? and P,. 


Turning back to (25) and putting A’ instead of A,’, we can now write 


A’ 
| and, using (28), 
Q 1 Be, Bn | Mo> fi, Be, 


or changing the columns of the first 
sign of the X’s: 


Q 1 Fis Bn | Ko> pas 
| 


| 
| 


| 

Be determinant into rows and changing the 
| 
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We see that the second determinant can be obtained from the first one by 
putting 2 for * (4 =0,1... ). The first determinant we shall denote by V,,, and 


the second by V,(a). Now the equation of the n-th regression-parabola can be 
written as follows: 


y= “Bod AA, sine (32). 
Applying the same method to (22), we easily find the expansion of H,,, 
namely, 
2 V? V3 ‘ 
= oy ES + aA, + eee + (33). 


As the calculation of higher parabolas and correlation ratios needs the know- 
ledge of the higher moments of the sampled population, the above results will be of 
interest in theoretical research, where the higher moments may be supposed to be 
known. Such a case arises, for instance, when we consider the dependency of the 
standard deviation upon the mean of a sample. 

The expansion of P,, can be used to find the necessary and sufficient conditions 
which the constants \ and yw must satisfy in order that the actual regression line 
may be represented by an n-th order parabola. 


represents the actual regression line. Then, using (24), we find 


1 Bos a Bn 
m= | Yat" p (a) dx =— Mis Ma 


n 


An> Pny +++ Mon 
where the coefficient of fn4, 1s not zero, and this is the necessary condition 
required. Assuming that the expansion (32) is permissible, we shall now prove 
that this condition is sufficient. In fact, assume that all the ),’s are linear 
functions of wz, represented by (35). Considering V,,,,, where s>1, 
we see that the terms of the first line of this determinant can be represented by 
the same linear and homogeneous function (35) of corresponding terms of the 
n+ 1 following lines of the same determinant. We conclude that-V,,,=0. Now 
if we consider V, and if we multiply the terms of its t-th row by the coefficient 
Of peyt1-2 (t= 2, 3...n+1) in (35) and subtract the products from the terms of 
the first line, we shall see that owing to (35) the determinant V,, will be trans- 
formed into the product of A, into the coefficient of yw», in (35), both of them 
being different from zero. Therefore V,+0. As the coefficient of 2” in V, (2) is 
An. + 0, we see that if all the A’s satisfy the condition (35), where the coefficient of 
Mn+x 18 not zero, the actual regression line is represented by an n-th order parabola. 
The general condition established can be considered as the generalisation of 


respective formulae given by K. Pearson for n < 4 (loc. cit.), and by A. A. Tchouproff * 
for n<2. 


“ Grundbegriffe u. Grundprobleme der Korrelationstheorie, Teubner, 1925, pp. 48—49. 
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ON THE PROBABLE ERROR OF THE MODE OF 
SKEW FREQUENCY DISTRIBUTIONS, 
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INTRODUCTION. 


THIS paper consists of four Chapters. The first treats of the deduction 
of a formula giving the probable error of the mode of Pearsonian curves as a 
function of 8, and ~,, the second deals with the requisite tables, the third 
represents the probable error of the mode as a surface, while the fourth Chapter 
is concerned with statistical applications. In spite of the importance of the 
mode no general investigation of the accuracy with which it can be determined 
has hitherto been made, although Professors Pearson and Filon* have indi- 
cated by a very different method its determination for special curves. I have 
endeavoured in this paper to find a formula for the probable error of the mode, 
to make a table of it for an adequate range, to draw a contour diagram and to 
make a model of the surface in order to visualize the whole field. The contour 
diagram provides facilities for finding an approximate value; a more accurate 
value can be obtained by interpolation from the table itself. 


* Phil. Trans. Vol. 191 (1898), pp. 229—311, 
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CHAPTER I. DEDUCTION OF A FORMULA TO CALCULATE THE PROBABLE ERROR 
OF THE MODE. 


Let the differential equation of the frequency curve be assumed to be given by 
the form : 


(1) 
Multiplying (1) by yas (Cy + + dx 
and integrating throughout the range b, to b, we get 
| (Cy + dy = — y (att + amt) dat (2). 
1 


Integrating the left-hand side of (2) by parts and remembering that usually 
the expression ya* (c, + ¢,# + c,#*) vanishes at both ends of the range we get: 


by by by 
— 50 | ay da — yda 


b, 
=-| a de (3), 


or with the usual notation for the moment coefficients 


+ (8 +1) + (8 +2) = + (4). 
Hence, the origin being any fixed point, we have for: 


s=0, Cy fly + 2Copy’ = py’ + Apo, 


a= I, Co + pty’ + Bey = pig (6), 
s=2, Woo ply BC; ple + = flys (7), 
s=3, + 40, + 5 Ce pig’ fg (8). 


Eliminating cy, c, and c, from (5), (6), (7) and (8): 


= VB poy’ + + pos! — 20 py’? + + Dye’? py’ 


LO pg, — LO pry’ — 1 + poo’ py’ — — 18 + 
Denote the distance of the mode from any fixed origin by MV: 


Now assuming the sample so large that statistical differentials may be replaced 
by mathematical differentials we reach by differentiating (10): 


Su, {Bs — 18p, +120," 10 Ga,’ 
( N D 
4 


N 
| 


KazuTaRo YASUKAWA 265 


N 


+ 32 — 54g," + 


D 
| — — 8 + 40 By’ — 


where all the ’’s are for the sampled population ; we shall indicate by rules over 
our symbols that they are the values of the corresponding quantities for this 
population. If now we refer to the mean of the sampled population as our fixed 
origin we shall have g,’=0 and 


D = — 12,2 — = (5B. — 6B, — 9) (13), 
(14). 
D 2 (5B, —6B,- 9) 
Introducing (12), (13) and (14) into (11): 
Mo |B.+3 58,-68,-9) WB 58,-68,-9 

(B.4+3 58,-68,-9) J/B(B,4+3) 58.—6B,-9 


8B, — 138.+9 16B, (B. +3) 
5B.—6B8,-9  (58,—6B, — 


| B.+8)(5B.- 20) 


6B,— 9° 
1 B.+ 3 + 128, (Bo+ 3) 
(58,-68,-9 (58,—6B, — 

Now if x be the number of individuals in the sample and we represent by 

curled brackets the mean value for all possible samples of the expression enclosed, 


o 


then: 
=< (17), = (Ba — 1) (18), 
Bs? B Bi Bs BS Be o 
Biometrika 18 


| 
& 
| 
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n n B, 


...(26), 
since pz, = 0 for the sampled population. 
Further let B,=3 138, + (Bs 
5B. — 6B, — 
— 9 (5B.—6B,-9)) 
Then we may write: 
SM = Sy,’ B, + JB, Bs + B, 
Square, sum for all possible mee divide “ their number, and we have: 
oy ~ {Be + Bi 1) B? + (B, B:) B? (By B.?) B? 2B, B.B, 
2B, 2B, B,B, + 2B. B,; (Bs 2B, (B, B:) 
+ 2B, B,(B; — Bi (28). 
Hence we deduce that : 
Ratio of the probable error of the mode to the probable error of the mean 
{B? ¥ 1) BY (B, B? + (Bs B.) B? + 2B, BB, 
+ 2B, B, (B; — B,B.) + 2B, B.B, + 2B, B,B; + 2B.B, (B; — 
+ 2B,B,(B,— ...... (29). 


CHAPTER II. TABLING OF THE PROBABLE ERROR OF THE MODE. 


(1) Explanation of Tables. It will be clear from this result that the deter- 
mination of the probable error (or of the standard deviation) of the mode for each in- 
dividual case would involve a large amount of computation. In order to avoid this 
it is necessary that the purely algebraical formula (29) shouid be replaced by tables. 
An examination of the values of B,, B,, B, and B, shows that they involve only 
8, and £., but the presence of the factor 1/(5B. - 68,—9) indicates that the 
probable error of the mode will become excessive in the neighbourhood of the 
line 58, -68,-9=0. But this is the “Rectangular Line” passing through the 
“ Rectangular Point” 8, =0, 8,=1'8, and is the so-called “axis” of the biquadratic 
which encloees J-curves and separates limited range curves from U-curves. When 
we get curves falling into the J-area our formula must break down, for the curve 


| 

| | 
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starts either with a finite or an infinite ordinate, which in a different sense is then 
the “mode.” The only variation in the position of this maximum ordinate is that 
due to change in the terminal of the range from sample to sample; but this is not 
given by our formula, which now gives a spurious mode lying outside the curve 
range. On the border of the biquadratic curve which encloses true J-curves the 
spurious mode becomes real. Along the “rectangular line” itself the curve 
suddenly changes from a right-handed J to a left-handed €, and under such cir- 
cumstances the spurious mode, as we approach the rectangular line, swings across 
and its probable error becomes infinite. This is the meaning of the factor 
1/(58. — 68, —9) in the B-terms, but the interest of this spurious mode is purely 
mathematical and not statistical, The biquadrate bounding the area of the 
spurious mode is not shown on Diagram I for it is too crowded; it passes near to 
contour 15 above and enclosing the asymptote reaches to about contour 7 below it. 

Passing now to a further point we notice that B,, B;, B; and 8, occur in the values 
of the mean powers and mean products of the moment coefficients. It was, 
therefore, needful to compute these, or to use the values given in Biometrika, 
Vol. vu. by A. J. Rhind for these quantities, of course assuming always that 
a Pearson frequency curve might be used. But on examination it was found 
(a) that there were certain errors in Rhind’s Tables, and (b) what is more im- 
portant, that smaller intervals of the argument 8, were needful. Accordingly new 
Tables of the values of §;, 8,, 8; and 8, for given values of 8, and ®, were com- 
puted from the difference formulae : 

Bog = (28+ 1) (1 + $a) Bos o}/{1 — (28 — 1) a}, 
Borin = (28 +2) (48, (1 + 40) — (28) a}, 

where == (28, — 3B, —6)/(BatB) (30). 


(2) The calculation of this new Table I, which the author believes will be of 
service for other purposes, occupied several months of strenuous labour. 


The next stage in the work was the computing of B,, B,, B, and B,; their 
values are not reproduced here, because it is not clear that they will be of service 
for other investigations. The values of the standard deviations and mean products 
are so easily obtained from Equations (17)—(26) that it does not seem worth 
while providing tables of them, although they occur frequently in practical work. 

(3) By aid of the Tables of the §’s and the Bs Table II giving the ratio 
of the probable error of the mode to that of the mean was constructed. It 
will be noted that in the final formula (29) there is no occurrence of J B,, which is 
found in (21), (23), (24) and (25), but drops out on the squaring of 6M and the 
substitution of the products of the moment coefficients. 

I suggest that the Table now provided will enable future workers to give the 
standard deviation of the mode with the same ease as in the past the standard 
deviation of the mean has been given, and thus to complete the specification of the 
constants of skew frequency distributions for large samples. I recognise that my 
work does not enable one to determine the frequency distribution of the mode, but 
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MAW 


WA 


Co 


SGOVWS 


Co 


VA 


“41440 
"44892 
“48493 
*52250 
“56169 
*60257 
64522 
68971 
*73612 
°78455 
*83508 
*88781 
"94286 
1°00032 
1°06033 
1°12300 
1°18847 
1°25690 
1°32842 
1°40321 
1°48145 
56333 
"64906 
‘73885 
83294 
“93160 
‘03510 
14375 


*25787 


*78154 

84727 

‘91579 

‘98720 
1-06162 
1°13918 
1°22000 
130423 
1°39200 
1°48348 
1°57882 
167821 
1°78182 
1°88985 
2°00250 
2°12000 
2°24258 
2°37049 
2°50400 
2°64339 
2°78897 
2°94105 
3°10000 
3°26618 
3°44000 
3°62189 
3°81231 
401176 


4°22080 


4°44000 
4°67000 


1°10667 | 1°39429 1°64888 1°87200 | 2°06839 
1°20075 | 1°51422 1°79186 2°03730 2°25370 
1°29873 | 1°63902 1°94118 2*20909 2°44615 
140077 | 1°76889 2°09643 2°38759 2°64600 
1°50701 | 1°90400 2°25783 2°57302 2°85348 
1°61763 | 2°04456 2°42561 2°76565 3°06886 
1°73280 | 2°19077 2°60000 2°96571 3°29241 
1°85270 | 2°34286 2°78125 3°17349 3°52442 
1°97753 | 2°50105 2°96962 3°38927 3°76518 
2°10750 | 2°66560 3°16538 3°61333 4°01500 
2°24282 | 2°83676 3°36883 3°84600 4°27422 
2°38371 | 3°01479 3°58026 4°08759 4°54317 
2°53043 | 3°20000 3°80000 | 4°33846 4°82222 
2°68324 | 3°39268 4°02838 4°59896 5°11175 
2°84239 | 3°59314 4°26575 4°86947 5°41215 
3°00818 ; 3°80174 4°51250 5°15040 5°72385 
3°18092 | 4:01882 4°76901 5°44216 6°04727 
3°36094 | 4°24478 5°03571 5°74521 6°38289 
3°54857 | 4:48000 5°31304 6-06000 6°73120 
3°74419 | 4°72492 5°60147 6°38704 7°09270 
3°94820 | 4°98000 | 5°90149 6°72686 7°46795 
4°16100 | 5°24571 6°21364 7°08000 7°85750 
4°38305 | 5°52258 6°53846 7°44706 8°26197 
4°61483 | 5°81115 6°87656 782866 8°68200 
4°85684 | 6°11200 7°22857 8°22545 9°11826 
5°10964 | 6°42576 7°59516 8°63815 9°57147 
5°37382 | 6°75310 7°97705 9°06750 | 10°04239 
5°65000 | 7°09474 8°37500 | 9°51429 | 10°53182 
5°93887 | 7°45143 8°78983 9°97935 | 11:04062 
6°24115 | 7°82400 9°22241 } 10°46360 | 11°56969 
6°55765 | 8°21333 | 9°67368 | 10°96800 | 12°12000 
6°88920 | 8°62038 | 10°14464 } 11°49356 | 12°69258 
7°23673 | 9°04615 | 10°63636 | 12-04138 | 13°28852 

9°49176 | 11-15000 | 12°61263 | 13-90900 

9°95840 | 11°68679 | 13°20857 | 14°55525 

12°24808 | 13°83055 | 1522862 

12°83529 | 14°48000 | 15°93053 

= — 15°15849 | 16°66250 

1586769 | 17°42618 

18°22333 

19°05585 


ee 


* In the values given for 8; and Be 


complete reliability beyond the eighth figure is not to be de- 


e 
| | 
| 
| 
| 
Bs. 
| 
pended on. 
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TABLE I—(continued). 


Values of Bs. 

1:0 1°2 1°3 1*4 1°5 
2°24000 2°38909 2°51765 2°62743 2°72000 2°79676 2°85895 2°90769 1°8 
2°44379 2°61000 2°75446 2°87904 2°98542 3°07509 3°14938 3°20948 1°9 
2°65532 2°83918 3°00000 3°13981 3°26038 3°36330 3°45000 3°52174 2°0 
2°87484 3°07688 3°25455 3°41000 3°54514 3°66167 3°76108 3°84474 21 
3°10261 3°32337 3°51837 3°68990 3°84000 3°97047 4°08291 4°17876 2°2 
3°33890 3°57894 3°79175 3°97980 4°14524 4°29000 4°41578 4°52411 
3°58400 3°84387 4:07500 | 428000 4°46118 4°62057 4°76000 4°88108 
3°83820 4°11848 4°36842 | 4:59082 4°78812 4°96250 5°11589 5*25000 
4°10182 4°40308 4°67234 4°91258 5°12640 5°31612 5°48377 5°63119 26 
4°37517 4°69800 498710 | 524562 5°47636 5°68176 5*86400 6°02500 27 
4°65860 5°00360 5°31304 5°59032 5°83837 6°05980 6°25692 6°43178 2°8 
4°95247 5°32023 5°65055 5°94702 6°21278 6°45060 6°66291 6785189 2°9 
5°25714 5°64828 6°00000 6°31613 6°60000 6°85455 7708235 7°28571 3°0 
5°57301 5°98814 6°36180 6°69804 7°00042 7°27204 7°51564 7°73365 
590049 6°34024 6°73636 7°09319 7°41447 7°70351 7°96320 8°19612 3°2 
6°24000 6°70500 7°12414 7°50200 7°84258 8°14938 8°42545 8°67353 3°3 
6°59200 7°08289 7°52558 7°92494 8°28522 861011 8°90286 9°16634 
6°95696 7°47439 7°94118 8°36250 8°74286 9°08617 9°39588 9°67500 
7°33538 7°88000 8°37143 8°81517 9°21600 9°57806 9-90500 | 10°20000 3°6 
7°72779 8°30025 8°81687 9°28349 9°70517 | 10°08630 | 10°43074 | 10°74184 37 
8°13474 8°73570 9°27805 9°76800 | 10°21091 | 10°61143 | 10°97362 | 11°30103 3°8 
8°55680 9°18692 9°75556 | 10-26929 | 10°73379 | 11°15400 | 11°53419 | 11°87812 39 
8°99459 9°65455 | 10°25000 | 10°78795 | 11°27442 | 11°71461 | 12°11304 | 12°47368 4°0 
9°44877 | 10°13921 | 10°76203 | 11°32463 | 11°83341 | 12°29386 | 12°71077 | 13°08830 4°1 
9-92000 | 10°64160 | 11-29231 | 11°88000 | 12°411423 | 12°89241 | 13°32800 | 13°72258 42 
10°40901 | 11°16243 | 11°84156 | 12°45475 ! 13°00916 | 13°51093 | 13°96539 | 14°37717 43 
10°91657 | 11°70247 | 12°41053 | 13°04962 | 13°62732 | 14°15012 | 14°62364 | 15°05275 YA 
11°44348 | 12°26250 | 13°00000 | 13°66538 | 14°26667 | 14°81071 | 15°30345 | 15°75000 45 
11°99059 | 12°84338 | 13°61081 | 14°30286 | 14°92800 | 15°49349 | 16°00558 | 16°46966 46 
12°55881 | 13°44600 | 14:24384 | 14°96289 | 15°61215 | 16°19927 | 16°73082 | 17°21250 
13°14909 | 14°07130 | 14-90000 | 15°64640 | 16°32000 | 16°92889 | 17°48000 | 17°97931 4s 
_ 13°76246 | 14°72029 | 15-58028 | 16°35432 | 17°05247 | 17°68325 | 18°25398 | 18°77093 49 
14°40000 | 15°39403 | 16°28571 | 17°08767 | 17°81053 | 18°46329 | 19°05366 | 19°58824 50 
15°06286 | 16°09364 | 17°01739 | 17°84750 ; 18°59520 | 19°27000 | 19°88000 | 20°43214 5°1 
15°75226 | 16°82031 | 17-77647 | 18°63493 | 19°40757 | 20°10442 | 20°73400 | 21°30361 5°2 
16°46951 | 17°57531 | 18°56418 | 19°45143 | 20°24877 | 20°96763 | 21°61671 | 22°20366 | 
17°21600 | 18°36000 | 19°38182 | 20°29739 | 21°12000 | 21°86080 | 22°52923 | 23°13333 a4 
17°99322 { 19°17581 | 20°23077 | 21°17500 | 22°02254 | 22°78513 | 23°47273 | 24°09375 5°5 
18°80276 | 20°02426 | 21-11250 | 22°08537 | 22°95771 | 23°74192 | 24°44842 | 25-08608 56 
19°64632 | 20°90700 | 22-02857 | 23°03000 | 23°92696 | 24°73250 | 25°45760 | 26°11154 5°7 
20°52571 | 21°82576 | 22-98065 | 24°01046 | 24°93176 | 25°75831 | 26°50162 | 27°17143 5°8 
21°44291 | 22-°78241 | 23-97049 | 25°02844 | 25°97373 | 26°82086 | 27°58192 | 28-26711 5:9 
_ 23°77895 | 25-00000 | 26°08571 | 27°05455 | 27°92174 | 28°70000 | 29°40000 6°0 
_ 24°81750 | 26°07119 | 27°18419 | 28°17600 | 29°06265 | 29°85746 | 30°57162 6:1 
_ _ 27°18621 | 28°32590 | 29°34000 | 30°24537 | 31°05600 | 31°78356 6°2 
— _ 28°34737 | 29°51300 | 30°54857 | 31°47182 | 32°29739 | 33°03750 6°3 
— — a= 30°74780 | 31°80387 | 32°74400 | 33°58353 | 34°33521 64 
— — = 32°03276 | 33°10820 | 34°06406 | 34°91642 | 35°67857 6:5 
— 34°46400 | 35°43429 | 36°29818 | 37°06957 6°6 
35°87390 | 36°85710 | 37°73108 | 38°51029 6°7 
— — — 38°33508 | 39°21750 | 40°00299 6°8 
— — 39°87100 | 40°76000 | 41°55000 69 
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270 Probable Error of the Mode of Skew Frequency Distributions 
TABLE I—(continued). 


Values of 


| 3°85714| 4°07037| 4°20199| 4:26974| 4:28739| 4:26566]  4°13599 
1:9 | 4:40244] 4°64442| 4°79441 487269 | 4°89492| 4°87323| 4°81715| 4°73416 
2-0 | 5-00000| 5:27357| 5°44352| 5°53307} 5°55998! 5°53802] 5°47791 5*38807 
2-1 | 565885 | 5°96205| 6°15366| 6:25517] 6°28681 6:26413 | 6°19921] 6710151 
2-2 | 6°36842| 6°71453| 6°92954| 7:04376| 7:08000| 7°05602| 6°98537| 6°87863 
23 714865 | 7°53619| 7°77640| 7:90384] 7°94461 7°91858 | 7:84109| 7:°72393 
2-4 | $:00000| 8:43274| 8:70000} 884126] 8°88615| 8°85714| 8°77143] 864228 
2-5 | 892857 | 9°41053| 9°70672| 9°91071| 9°87753! 9°81065| 9°63899 
26 | 9°94118 | 10°47664 | 10°80364| 10°97368| 11°02497| 10°98615}| 10°87892} 10°71985 
2-y | 1104545 | 11°63897 | 11°99861 | 12°18325 12°23629; 12°19003| 12°06882| 11°89115 
2-8 | 12°25000 | 12°90637 | 13°30040 | 13:49946| 13°55280| 13°49688 | 13°35901| 13°15981 
2-9 | 13°56452 | 14:°28876 | 14°71881 | 14°93176| 14°98353| 14°91525| 14°66914| 14°53334 
3-0 | 15:00000 15°79734 | 16°26482 | 16°49068| 16°53846| 16°45455| 16°27321| 16°02004 
3+1 | 16°56897 | 17°44479 | 17°95077 | 18°18802 |} 18°22872|] 18°12523] 17°91578| 17°62896 
3-2 | 18°28571 | 19°24547 | 19°79058 | 20-03698 | 20°06672! 19°93875| 19°69601| 19°37010 
33 | 20°16667 | 21°21577 | 21°80000 | 22°05242 | 22-06630| 21°90809| 21°62578} 21°25445 
3:4 | 22°23077 | 23°37444 | 23°99693 | 24-25111 | 24°34303| 24°04755 | 23°71830| 23°29418 
3-5 | 24°50000 | 25°74304 | 26°40178 | 26:65201 | 26°61435| 26°37315 | 25°98826 | 25°50270 
3-6 | 27-00002 | 28°34641 | 29°03790 | 29°27665 | 29:20000| 28°90287| 28°45200| 27°89494 
3-7 | 29°76087 | 31:21363 | 31°93214} 32°14962 | 32-°02226| 31°65688| 31°12780| 30°47745 
| 32°81818 | 34°37829 | 35°11555 | 35°29902| 35°10645 | 34°65795| 34°03614]| 33°29869 
| 36°21429 | 37°88011 | 38°62415 | 38°75716} 38°48143| 37°93182]| 37°20000] 36°34923 
4°0 4000002 | 41°76595 | 42°50000 | 42°56138 | 42°18020} 41°50769| 40°64530| 39°66217 
41 | 4423684 | 46-09165 46°79249 | 46°75499 |} 4624069 | 45°41884| 44°40133 | 43-26338 
B.. | 42 | 49°00000 | 50°92418 | 51°56000 | 51°38852 | 50°70667 | 49°70330 | 48°50130| 47°18203 
43 | 54°38235 | 5634460 | 56°87196 | 56°52129| 55°62888| 54°40472| 52°98300} 51°45104 
4:4, | 60°50000 | 62°45189 | 62°81163 | 62°22222| 61-06648 | 59°57347| 57°78962| 56°10775 
69°36821 | 69°47964 | 68°57759 | 67-08882| 65°26786| 63°27068 | 61°19457 
46 77°24602 | 76°99870 | 75°68397 | 73°77764| 71°55580} 69°18323| 66°75995 
= = 85°52000 | 83°66259 | 81:23000| 78°51680| 75°69329 | 72°85939 
48 = — 95°23182 | 92°65977 | 89°56191 | 86°24448 | 82°87765 | 79°55676 
49 102°85541 | 98°91311 | 94°84982 | 90°82608 | 86°92588 
50 = — 114°47304 | 10945344 | 104:46524 | 99°64417| 95°05252 
51 — 121°39118 | 115*25000 | 10945684 |. 104-03682 
5:2 - 13498447 | 127°39721 | 120°41296 | 113°99636 
53 — 141°14317 | 132°69118 | 125-07009 
| 156°78005 | 146°50770 | 137°42323 
55 | = 162°12647 | 151°25376 
56 a5 = 179°87296 | 166°80075 
58 — — | — — 20427568 
66 — — — 
6-7 _ — = = - 
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TABLE I—(continued). 
Values of B,. 
Ai. 

8 ‘9 1-0 11 1:2 13 15 
4*04000 3°92924| 3°80711| 3°67634| 3°53913| 3-39727| 3-25220| 3-10510| 1°8 
4°63021 | 4°51000| 4°37733 | 4-:08613| 3-93202| 3°77448| 3°61479| 1-9 
5°27513 | 5°14437| 5-00000| 4°68319| 4°51562| 4°34441| 4-17097| 2-0 
5°97844 | 5°83588 | 5°67849| 5°51000| 5°33338| 5°15101| 4:96481| 4°77632| 2:1 
6°74410 6°58831) 6-41643| 6-23257| 6:04000| 584134] 5°63869| 5-43372| 2-2 
757640 7:01679| 680660} 6°59000| 6°36928| 614625 | 2:3 
8°48000 | 829265 | 8-08654| 7:86667| 7°63697| 7:40059| 7-16000| 6-91718| 24 
9°45993 | 9°25376| 9°02746| 8-78650| 853521; 827699} 8-01455| 7-75000| 25 
10°52168 | 10°29429| 10-04538| 9-78094| 9°50570| 922337] 8-93686| 8-64846| 2-6 
11°67120 | 1141986 | 11°14560| 10°85499| 10°55318| 10°24419| 9-93116| 9°61657| 2-7 
12°91498 | 12°63657 | 12-33387| 12°01406| 11°68273| 11°34426| 11°68273| 10°65861 | 2-8 
14°26010 | 13°95159 | 13-61641 | 13-26400| 12-89988| 12°52874| 12°15424| 11-77918| 2-9 
15°71429 | 15°37056 | 15-00000 | 14°61114| 14°21053| 13°80322| 13°39311| 12°98319| 3-0 
17°28600 | 16°90296 | 16-49198 | 16°06234| 15°62111| 15°17370| 1472426] 14-27595 | 3-1 
18°98452 | 18°55685 | 18+10036 | 17°62506| 17°13857| 16°64667| 16°15375| 15-66314| 3-2 
20°82000 | 20°34167 | 19°83386 | 19°30739| 18°77043 18-22912| 17-68811| 17°15037| 3-3 
22°80364 | 22°26770 | 21-70199| 21-11814| 20°52486| 19°92866| 19°33441] 18°74572| 34 
24°94775 | 24°34628 | 23°71517 | 23-06690 | 22°41071 | 21°75348| 21°10026| 20°45478 | 3-5 
27°26593 | 26°58983 | 25°88479| 25°16414| 24-43765| 23°71249| 22-99392| 22-28571 | 3-6 
29°77323 | 29°01036 | 28°22335 | 27°42130] 26°61616 | 25°81537| 25-02430! 24-24676 | 3-7 
32°48632 | 31°62803! 30°74459 | 29°85091 | 28°95770| 28-07261| 27:20107| 26-34686 | 3:8 
35°42369 | 34°45448 | 3346360 | 32°46669| 31:47479| 30°49567| 29°53475| 28-59568 | 3:9 
38°60597 | 37°50988 | 36-39706 | 35-28370| 34°18110| 33-09702| 31-00367 | 4:0 
42°05617 | 40°81478 | 39°56337 | 37-94426| 37:09164| 35°89027] 34-71942| 33°58221 | 4-7 
45°80000 | 44°39200 | 42-98291 | 41°58947 | 40-22286| 38-89031 | 37°59636| 36-34362| 4-2 
49°86636 | 48°26703 | 46°67828| 45-11667| 43°59285| 42°11346| 40°68231| 39-30132 | 4-3 
54°28773 | 52°46836 | 50°67464 | 48-92245 47-22155| 45°57759| 43-99340| 42-46993 | 4-4 
59°10079 | 57°02793 | 55-00000| 53-03157| 51°13095| 49°30236| 47°54727| 45-86538 | 4-5 
64°34706 | 61°98164 | 59°68567 | 57-47152| 55°34637| 5330941 | 51-36330] 49°50509 | 4-6 
70°07371 | 67°37000 | 64°76674 | 62-27294| 59°89174| 57°62259| 55-46274| 53-40809| 4-7 
76°33454 | 73°23883 | 70°28261| 67-47004| 64°80000| 6226828 | 59°86897| 57:59525 | 4-8 
83°19114 | 79°64013 | 76:27770| 73°10111| 70°10345| 67:27568| 64-60777| 62-08948 | 4-9 
90°71429 | 86°63316 | 82-80220| 79:20913| 75°83926| 72°67717| 69°70761| 66°91597 | 5-0 
98°98572 | 94°28567 | 89-91304| 8584250] 82-04900| 78°50882| 75-20000| 72°10244| 
10810025 | 102°67545 97-67502 | 93-05590| 88°77931 | 84-81081 | 81°11987| 77-67953 | 5:2 
11816846 | 111°89225 | 106°16215 | 100°91122 | 96-08264 | 91-62805 | 87-50607| 83-68109 | 5:3 
129°32001 | 122°04000 | 115-45933 | 109-47890 | 104-01818 | 99-01089} 94-40185 | 90°14466 | 5-4 
141°70781 | 13323978 | 125-66434 | 11883929 | 112°65295 | 107-01593 | 101°85554 | 97-11190 | 55 
155°51348 | 145°63333 | 136°89033 | 129-0844 | 122-06304 | 115-70695 | 109-92127 | 104-62922 | 5-6 
170°95414 | 159°38758 | 149°26891 | 140°32027 | 132°33522 | 125-15610 | 118-65982 | 112°74843 | 5-7 
188°29143 | 174°70041 | 162-95392 | 152°66919 | 143°56876 | 135°44525 | 128-13965 | 1291-52747 | 5:8 
207 °84335 | 191°80793 | 178°12626 | 166-27335 | 155°87777 | 146°66761 | 138-43810 | 131-03138 | 5:9 

- 210°99415 | 195-00000 | 181-29870 | 169-39394 | 158-92977 | 149-64286 | 141-33333 | 6-0 

232°60341 | 213-83011 | 197-94009.| 184-27000 | 17235402 | 161-85366 | 152°51594 | 6-1 

23492263 | 216742764 | 20068400 | 187-08139 | 17518440 | 164-67274 | 6-2 

-- = 258°64778 |,237°03500 | 218°84465 | 203-27519 | 189°76568 | 177-91006 | 6-3 

_ — -- 260°08994 | 238-99803 | 221°12555 | 205-74781 | 192°34915 | 6-4 

285°98810 | 261-43629 | 240-85495 | 223-30470 | 208-12889 | 6-5 

286°50857 | 262°72535 | 242-63850 | 225-40894 | 6-6 

314°63551 | 287-04710 | 263-98543 | 244-37376 | 6-7 

= -- 314°19047 | 287-62313 | 265-23750 | 6:8 

344-60055 | 313-88000 | 288-25000 | 6:9 
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TABLE I—(continued). 
Values of B;. 


Ay. 


BSW 


& 


> Co % 


2 3 5 6 
1°41563 | 2°55335| 3°45274| 414938 467447 5°05504 
168269 | 3°03698! 4°11024 494488 5°57 799 604158 
199086 | 3°59423| 486684 5°85930 6°61567 717383 
2°34609 | 4°23542| 5°73605 690840 780484 8°47016 
2°75531 | 4°97250| 6°73343 811022 9716522 995138 
3°29646 | 5°81933 7°87680 9°48543 | 10°71931| 11°64114 
3°76936 | 6°79200| 9°18694| 11°05781| 12°49286| 13°56632 
4°39479 | 7°90935| 10°69172| 12°85476| 14°51538| 15°78319 
5°11605| 9°19345| 12°40745' 14°90800| 16°82084] 18°25025 
5°94878 | 10°67033| 14°37849| 17°25435; 19°44838]  21-08439 
691168 | 12°37077| 16°63931| 19°93669| 22°44332|  24°30640 
8°02724| 14°33140| 19°23517| 23°00517| 25°85827| 27°87491 
9°32265 | 16°59605 | 22°21964| 26°51868| 29°75455| 32°13621 
10°83106 | 19°21744} 25°65651| 30°54669| 34°20394| 36°87766 
12°59314 | 22°25945| 29°62217| 35°17151|. 39°29040| 42-27778 
14°65928 | 25°80000| 34°20857| 40°49118| 45°11360| 48743448 
17°09242 | 29°93481| 39°52721| 46°62315| 51°79124! 55*46278 
19°97203 | 34°78258| 45°71414| 53°70889| 59°46367| 63°49841 
23°39961 | 40°49179| 52°93674| 61°92000| 68°29905} 72°70228 
27°50649 | 47°25012| 61°40265| 71:46601| 78°50005| 83°26613 
32°46494 | 55°29753} 71°37178| 82°60477} 90°31249| 95°41972 
3850483 | 64°94474 | 83°17295| 95°65633 | 104°03669 | 10944000 
45°93863 | 76°60000| 97:22686| 111°02153] 120°04231| 125°66296 
55°20045 | 90°80847 | 114-07864 | 129°20765| 138°78825| 144-49908 
66°90894 | 108°31200 | 134°44499 | 150°86399 | 160°84945 | 166°45371 
81°97288 | 130°14217 | 159°28370 | 176°83235 | 186°95369| 192°15427 
101°77708 | 157°77045 | 189°89859 | 208°21978 | 218°03271 | 222°14023 
128°52933 | 193°36018 | 228°10415 | 246°50617| 255°29464| 25814675 
165°95745 | 240°21033 | 27648965 | 29370715 | 300°32869 | 30070801 
303°58400 | 338°86359 | 35262801 | 355°25957 | 351°72287 
392°37271 | 421°03600 | 427-27315| 422°98141| 413°36471 
532°27910 | 523°53374 | 507°52256| 488-5393 
688°26156 | 650°39838| 614°63369 | 581+20426 
82221327 | 752°77500| 696°86837 
~ — 1063°22283 | 934°85457| 843°39459 
~- ia 1181-47331 | 1032-34962 
— 152744262 | 1281-37852 


5°31428 
6°36300 
756603 
8°94237 
10°51347 
12°30359 
14°34023 
16°65462 
19°28231 
22°26384 
25°64561 
29°48081 
33°83063 
38°76566 
44°36758 
50°73124 
57°96718 
66°20474 
75°59586 | 
86°30244 | 
98°58916 | 
112°65722 | 
128°82752 | 
147°46592 | 
169°01630 | 
194:02097 
223°14757 
257°22494 
297 °29164 
344°66229 
401°01993 | 
468°54685 | 
550°11337 
649°55652 
772°10198 | 
925°02170 
1118°69384 
1368°38182 | 
169736503 
2142-78154 | 
2767 36099 | 
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TABLE I—(continued). 
Values of B;. 
Ay. 
| 8 9 10 1°2 13 15 
| 5*47200 5*54505 5°54774 5*49223 5°38882 521032 5°07194 4°87218 | 
6°66900 6°69008 6°64297 6°53966 6°39032 6°20353 598658 | 
7°81963 7°95777 8*00000 796281 7°86015 7°70376 7°50359 726805 | 2-0 
9°25357 9°43082 9-49691 9°47100 9°36924 9°20521 8-99040 8°73446 | 2-1 
| 10°88912| 11710996} 11°20249| 11-°18895| 11°08800|} 10°91533| 10°68412] 10°40546| 2:2 
12°75081 13°01968} 13°14100| 13°14055| 13°03987| 12°85700| 12°60708| 12°30280] 2:3 
14°86629| 15°18749| 15°33963| 15°35250| 15°25093| 15°05566| 14°78400| 14:45047| 24 
| 17°26684| 17°64442] 17°82888] 17°85464| 17°75028| 17°53956| 17°24229| 16°87500| 25 
| 19°98792|  20°42546| 20°64305| 20°68041| 20°57039| 20°34018| 20°01238! 19°60577]| 2-6 
| 23-06977 | 23°57017| 23°82076 23-86733 | 23°74757 | 23-49257 23°12805| 2267532 | 2-7 
26°55821 | 2712336 27-40557 27-45754| 27°32248| 27-03588| 2662694 | 2611977 | 28 
30°50551 | 31°13665| 31°44672| 31°34075| 31-01385| 30°55093| 29°97994| 29 
| 3497143 { 35°66579| 36-00000| 36°04377| 35°85359! 35°47542| 3494677; 34°29832] 3-0 
40°02452| 40°77897| 41°12872| 41°15378| 40°91866} 40°47551| 39°86670| 39°12670]| 3-1 
45°74361 | 46°55101| 46°90489| 46°89705| 46°60097! 46°07576| 45°36914| 44°51979 | 3-2 
52°21964| 53°06850| 53°41068| 53°35129| 52°97396! 52°34552| 51°51961| 50°53845| 3-3 | 
89°55782| 60°43103 60°73997 6060490 | 6012077 59°36297 | 58-39163) | 3-4 | 
| 67°88034| 68°75344| 69°00044! 68°75867| 68°13571| 67°21640} 66°06791| 64°74368 | 3-5 | 
| 77°32958| 78°16863| 78°31586| 77°92779| 77°12604| 76-00572| 74°64162| 73°09286| 3°6 | 
88°07212 | 88°82758|  88°82894|  88°24432| 8721400] 85°84425] 84:21798| 82°40021 | 3-7 | 
100°30364 | 100-91991| 100-70481} 9986000 | 9853927] 96°86080| 94-91600| 92°77596 | 3-8 
11425501 | 114°64607 |} 114°13512| 112°94981/| 111°26193| 109°20215| 106°87064| 104:34450| 3-9 | 
130°19998 | 130°25632} 129°34314| 127°71610| 125°56589| 123-03600| 120-23522| 117:24648 | 4-0 
148°46471 | 148-04207 | 146°59001 | 143°45076| 141°66316| 138°55442| 135°18444| 131°64135 | 4-7 
| 169°44000 | 168°34874 | 166°18242| 163°23621| 159°79886| 155°97808| 151-91782| 147°71022| 4-2 
193°59693 | 191°58787| 188°48207} 184°62341| 180°25740| 175°56124| 170°66381| 165°65935 | 4-3 
221°50693 | 218-25251| 213°91770| 208°87083| 203°36995| 197°59782| 191°68480| 185°72423| 4:4 
253°86810 | 248-93685 | 243°00000| 236744133 | 229°52365| 222-42874| 215-28302| 208°17445| 
291°53977 | 284:36158| 276°34066 | 267-85963| 259-17616 | 250°45088 | 241°80774| 233°31950]| 
335°58868 | 325-4U700 | 314°67673 | 303°75068| 292°85347 | 282°12801 | 271°66398| 261°51581 | 
387°35126 | 37315656 | 358-90198 | 34486280 | 331-20000| 318-00442 | 305-32303| 293°17509| 4-8 
448°51903 | 428°95489| 410°10784| 392-09743 | 374:96309| 358°72156 | 343°33557| 328-77466 | 4-9 
| 5291-25714 | 494°48636, 469°63736| 446°54746| 425-06230| 405-03908 | 386°34752| 368°86975 | 5-0 
608°37258 | 571°88229| 539°15698| 509°54711 | 482°57201| 457°86117| 435°12000| 414-10888 | 5-1 
713°55655 | 663°87024 | 620°75355 | 582-73768| 548°81752| 518°26993| 490°55405 | 465°25256 | 5-2 
841°74254 | 773°98556 | 717°06701 | 668°15520| 625:41017 | 587°56812| 553°72181} 523°19655 | 5-3 
999°64805 | 906°87600| 831°47510| 768°34849| 714°33162| 667°33415| 625°90596 | 589-00103 | 5-4 
1196-61837 | 1068°75185 | 968°35433 | 886°54107| 818°03932| 759°49378 | 708°65022| 663°92692 | 5:5 
1445°98797 | 1268-06663 | 1133°45664 | 1026°85589 | 939°60867 | 866°41460| 803°82428| 749°48239! 5-6 
1767°36192 | 1516-57773 | 1334°46556 | 1194-63332 | 1082-92777 | 991°03145 | 913°70857 | 847°48183 | 5-7 
| 2190°63315 | 1831-05543 | 1581-83626 | 1396-88998 | 1252-96795 | 1137-01614 | 1041-10567 | 960°12198 | 5:8 
| 2763-47753 | 223614894 | 1890-10532 | 1642-99492 | 1456-16614 | 1309-00889 | 1189-48840 | 1090-01813 | 5-9 
ema 2769°43027 | 2279-99999 | 1926-12857 | 1700-97663 | 1512-94076 | 1363-20000 | 1240°65000 | 6-0 
3490°80240 | 2781-97487 | 2322-69706 | 1998°68407 | 1756°48819 | 1567°72730 | 1415-90414 | 6-1 
pee - 3442-43442 | 2799-25452 | 2364-63286 | 2049-72980 | 1810°08041 | 1620-93898 | 6-2 
4335°33729 | 3412-44751 | 2820-14136 | 2406-11294 | 2099-32960 | 1862-18916 | 6-3 
4218°75744 | 3395-58295 | 2843-91584 | 2447-37773 | 2147-87250 | 6-4 
5308°17016 | 4135°55270 | 338852216 | 2870°10032 | 2488°61733 | 6-5 
5107°94743 | 4076-08374 | 3389-06759 | 2898°36583 | 66 
we 6420°90275 | 4959°65704 | 4034°22238 | 3395-70595 | 6-7 
6119°99250 | 4848-19168 | 4005°88251 | 6-8 
7685°64544 | 5893°51196 | 4763-92498 | 6-9 
re 7265°32153 | 5719°68219 | 7:0 
| 


| 
5, 
| 
| 
| 
! 
; 
\ 
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TABLE I—(continued). 
Values of B,. 


0 1 4 5 6 
900000]  11°37904 11°65931 11°57571 1124612 10°75489 1016245 
1:9} 11°26008| 13°23077 1424766 14*60515 14°51125 1411351 13°51686 12°79605 
2:0| 14:00000| 16°46165|  17°72924 18°17685 1806655 17°58286 16°85605 1597830 
21| 17°31490|  20°37698| 21°94568 22°49677 22°36051 21°76742 20°87902 19°80872 
22| 21°32037|  25°12011 2700353 27°71253 27°53558 26°80218 25°71216 24°40477 
26°15756|  30°98446| —33-20564 34-00193 33°76303 3284853 3150583 29°90542 
32°00000| 3778431 40°64836 41°58212 41°24961 40°10000 38°43902 36°47547 
25| 39°06250|  49°64275 50°74739 50°24583 48°78960 46°83136 44°31091 
26| 47°61300|  56°36215|  60°52268 61°73207 61-05669 59°19886 5660720 53°64551 
27| 55°66909| 68°73087| —73°70383 75-02652 74:05525 71°66945 68°44455 64°75897 
70°61765|  83°79934| 89°70741 91-09765 89°70029 86°61786 82°56030 77°98719 
2-9| 86°04990| 102°22137] 109-19212 110°56655 | 10855927 104°55444 98°96957 93°73497 
105-00000| 124°84101| —132-99779 134-21520|  131°33846| 126710769] —119°60133 11249293 
128-40949| 152°76061| 162-70552 16303622 158°92300| 152°05685| 14376373 13485446 
3°2| 157°53846| 187°43873| 198°22205|  198-30084| 19243054 183°37199 172°75506 161°54896 
33| 194:10417| 230°83408| 24290000] 23328402]  221-27382]  207°61816 19346858 
240°49650| 285°62144| 298°71190|  283°31160|  267°30520| 231°71559 
300712500! 355°52475| 369-00857| 361-91859|  344°88577| 32343535 | —300°50441 277 °65897 
3°6 | 378-00027| 445°84514| 458-40994|  445-49756} 421712000} 362°24145 333°01035 
| 481°75409| 564°33393| 573°41334|  551-18736| 516°15119| | 4437-52021 399°82427 
623°54548| 722°67424| 723°36878|  686-21072|  635°55341 581°87572 | 529°76495 481°13873 
3-9| 823°87500| 939°16548| 922-09827|  860-79199| 78695638 | 712°91403| 643-44000 580°15336 
4:0 | 1120°00044 | 1243°83108| 1190°70000| 1089-74754! 98099234  877°83511| —_784-43293 701°49500 
4°1 | 1586°99669 | 1688-93436 | 1562-65014 | 1395-14604} 1232°78583 | 1087°37952 960-60823 851-07448 
4°2 | 2401-00000 | 2372-80866 | 2093-67200} 1810-95208} 1564°37061 | 1356'06923| 1182°62312| 1036-70173 
Bz. | 4-3 | 4092-27183 | 3501741300 | 2882-34489] 2391-55352 | 2008-75892| 1706-85856 | 1465-15573! 1268-83033 
4°4 | 9317-00007 | 5580-42447 | 4117-65621 | 3228-42286 | 2617-09069 | 2169-64147 | 1826-36689] 1561-65790 
10228-34035 | 6204-67651 | 4485-37892} 3471-86981 | 2792-18750} 2303-06656] 1934-78712 
46 26594°29656 | 10169°64897 | 6481-22602} 4713°08621| 3648-13982] 2931°31183| 2415-79639 
19470-30398 | 9912-98937| 6594°13318| 4857-09743 | 3779°10320| 304433255 
48 56371°22036 | 16599-18193 { 9614-97614 | 6623-62278 | 4949°12279|  3878°84348 
49 32999-08240 | 14889-79442 | 9321-73705 | 660916370] 5008-09053 
5-0 _ 10769538628 | 2541319110 | 13699°35294| 9048-42104| 
— 52445°38896 | 21455°52501 | 12800°88518 8799°46863 
5-2 = _ 19887214895 | 37301-06052 | 18948:75669| 12089-95456 
53 80097-49338 | 29999°60559| 17186-73944 
ah “ 37290859530 | 53139°45765| 25617741462 
5S ~ — 119233-63920| 40995°34689 
56 = ~ 759289°60401| 74049-27108 
ore | 
== — — = 


436 
523°3 
628°4 
756°5 
913°¢ 
1107°2 
1347 
16484 
2028-2 
2512°7 
3138°] 
3956°7 
6523°1 
8575+ 
11510°¢ 
15866°¢ 
22666: 
34006°% 
55018*: 
101473" 
253760" 


8 
9°5 
12°0) 
a 15°0 
18°6 
| 
| 28°1 
34°3 
41°7 
60°8 
87°8 
105-2 
125°8 
150°3 
179°4 
214-2 
255°7 
305-2 
364-9 
a 
| = 


13 


9°51200 
12°00421 
15°01475 
18°63611 
22°97697 
28°16526 
34°35200 
41°71577 
50°46839 
60°86175 
73°19633 
87°83176 
105-20000 
125°82186 
150°32793 
179°48509 
214°23055 
255°71562 
305°36277 
364°94018 
43666063 
523°31347 
628°44297 
756°59211 
913°64000 
110727652 
1347-67763 
1648-49012 
2028°28450 
2512°75884 
3138°15849 
3956°74047 
5045°80000 
6523-18412 
8575°22779 
11510-03758 
1586664876 
22666°51071 
34006 36363 
| 55018°58014 
101473 "89624 
253760°75389 
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TABLE I—(continued). 
Values of By. 


9 1-0 1°2 
883433 815118 747777 6°82451 
1117900 10°34692 9°52638 8*72995 
14°01129 13-00000 1200305 1103545 
17°41688 16°18967 14-98100 1380869 
21°49603 2000831 18°54532 17°12800 
26°36630 24°56380 22°79495 2108421 
32°16574 29°98227 2784518 25°78270 
39°05687 36°41153 33°83049 31°34598 
47°23137 44-02511 40°90805 37°91659 
56°91597 53°02714 4926193 45°66075 
68°37949! 63°65838 59°10810 54°77260 
81°94402 76°20351 70°70058 65°47942 
97°98379 91-00000 84°33882 78-04766 
11696246!  108°44918|  100°37681 92°79065 
139°42608|  129°02981| —119°23414 110:07767 
166°03608| 153°31496| 130°34509 
197°59354|  181°99321 167°49870 154°11009 
235°07313|  215°89514!  198°21347 18198750 
279°66656|  256°02699| 234°41002 214-71069 
33282366) 303°61339| 277°12110 253°15751 
396°40001| 360715227) 327°59804 298°38281 
472°60352| 427°48564; 38736420 351°65936 
564°27028|  507°89191| 45828341 41452944 
674:95976|  604:20700| —538°52748 488°87120 
809°19917| 719°98521| 643-29726 57698306 
972°79761|  859°71417 763°76288 681°69350 
1173°28180| 1029°10666| 908°47666 80550403 
1420°50928| 1235°50000| 1083-03746 955°77730 | 
1727°54471| 1488741118} 1294°57784| 1135-00321) 
2111°93585| 1800°31916! 1552°26683| 1351-05386) 
2597°60657| 2187°78540| 1868-00882} 1612-80000 
3217°73134| 2673°08744| 2257°43040| 1931-57581 
4019°21171| 3286°65035| 2741-29638, 2322-13115 
5069°85289{ 4070°74603| 3347°57848| 2803-84815 
6470°26979| 5085°27254| 411454025] 3402-51244 
8374°45168| 6417°05252} 5095°44479| 4152-90618 
11027°01600| 8195°32568| 6365-92882] 5102-68429 
14834°80538| 10618°63416! 8035°91250| 6318°30392 
20514°98864| 14003°83814| 10269°52127| 7894°34108 
29432°15808| 18881-04131] 13319°83857| 9968-58982 
44471°78151| 26191°91258| 17592°66245| 12747-42898 
72771°12146| 37747°20230| 23770-94842; 16550-29979 
137288°89025} 57434°99984| 32803°61819| 21891-83037 
368137°95820| 95110°70470| 47889°60904| 29643-43991 
18395854967 | 73394°60360| 41378-15038 
537373°87 106 | 123089742788 | 60189-03145 
244764'82983| 92941-09117 
796873°64937 | 158003-73264 


324150°69600 


1217690°26020 


116774°84051 
201460°19097 
428250°86517 
1965948 °20210 


13 
6°16894 560360 
7°96593 723956 

10°10708 9-22407 
12°68431 1161499 
1576982 14:47888 
19°44700 1789242 
23°81232 21°94400 
28-9749 26°73564 
3507200 32-3852] 
42-24625 39°02907 
50°67518 | 46°82519 
60°56257 55°95689 
72°14637 66°63722 
85°70487 79°11430 
10156439 93°67759 
120°10836| 
141:78858| 1130-47480 
167°13881|  153°57131 
196-79148| 180°50386 
231°49838|  211-92066 
272°15609|  248°58964 
319°83784| 29142344 
375°83325| 341°51036 
441-69836| 400-1531] 
519°31919|  468-91698 
610°99251|  549°69077 
719752993}  644°76391 
848°39240| 756-92502 

1001-86621} 889-58832 

1185-29254| 1046°95774 

1405°37863} 1234-24080 

1670°60729| 1457-93113 

1991-80039! 1726-18369 

2382°89098 2049°32000 

2862°00092| 2440°51504 

3452°96676| 2916-74377 

4187°53643| 3500°10258 

5108-59113! 4219-68376 

6274°97232| 5114-27804 

7768°88564| 623634746 


9707 °56565 
12262°25509 
15690°22476 
20391-20888 
27012°30625 
36656°13405 
51329°91458 
75027 °70957 


765798899 
9480°10426 
11846°90000 
14969 °55326 
19166°32023 
24933°62294 
33079°04770 
44987 59385 
63202°70610 
92848°88108 
145728°18199 
255463 °29045 


15 


5°04289 
6°55383 
8°38990 
10°60464 
13°25951 
16°42508 
20°18247 


24°62500 | 


29°86015 
36°01182 
43°22304 
51°65914 
6151151 
73°00209 


86°38868 | 3:2 
101°96895 | 3°3 
120°09977 | 3-4 
141°18296| 3°5 
165°70000 | 3°6 
194°21319| 3°7 
227°38490| 3°8 
265 °99719| 3°9 
310°97634| 4°0 
363°42307 | 4°71 
424°64982 | 4°2 
496°22740)| 4°3 
580°04326 | 4°4 
678°37500| 
793°98353 | 4°6 
930°23215| 4°7 
109123937 | 4°8 
1282°07756)| 4°9 
1509°03180| 5:0 
1779°94211| 5°1 
2104°65868 | 5°2 
2495 
296886115 | 
3544°81789 | 5°5 
4250°28332 | 5°6 
5120°52074| 
6202°60000 | 5°8 
7559°91373 | 
9281 °23336 | 6°0 
11488°51177 | 6°1 
14358 °25475 ; 6°2 
18149°09652 | 6°3 | 
23252°05212 | 6-4 
30279°45386 | 6°5 
40232 °52177 | 6°6 
54839°83220 | 6°7 
77304°82418 | 6°8 
114166°67456 | 6°9 
7°0 


565740°80342 | 180792°85424 


is Ce So Co % WW WW 
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7 
1 
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Ratio of the Probable Error of the Mode to the Probable Error of the Mean. 


| 1°8 21°3313 4°2009 2°1183 1°2174 *7269 “4193 *2072 
| 1°9 | 7°7368 | 457°8417 | 12°9077 4°3927 2°1953 | 1°2549 *7466 “4311 
| 20 | 4°3012 | 33°2972 | 169°7287 | 12°4767 4°4553 | 2°2432 1°2838 "7642 
| 21 |3°1616| . 8°4333 | 84°9902| 96°6366 11°7814 | 4°4488 2°2717 1°3064 
2-2 | 2°5970 4°2726 | 14°0362 | 242°9030 65°5132 | 12-0059 4°4041 2°2867 
| 23 | 2°2634 3°1503 6°5493 | 22°8026 | 1167°9192 | 48°8362 10°3622 4°3384 
| 24 | 2°0460 2°6578 4°2238 9°1310 36°8018 a) 38°6385 9°7353 
25 | 1°8957 2°2921 3°1915 5°4143 12°6243 | 61°2427 | 1517°6874 | 31°8420 
26 | 1°7881 2°0583 2°6332 3°8779 6°9579 | 17°4427 | 108°2317 | 434°5838 | 
| 27 | 1°7095 1°8984 2°2922 3°0755 4°7362 | 8°9118 24°2562 | 216°1278 
| 28 | 1°6516 1°7842 2°0664 2°6007 3°6263 | 5°7946 11°4150 | 34°2005 
2:9 | 1°6103 1°7004 1‘9086 2°2942 2°9870 | 4°2963 7°1814] 14°6528 
3-0 | 1°5811 1°6381 1°7942 2°0841 2°5828 | 3°4550 5°1003 8°6834 
3:1 | 1°5622 1°5918 1°7093 1°9340 2°3105 | 2°9332 4°0108 6°0586 
| 3:2 | 1°5520 1°5578 1°6455 1°8238 2°1187 | 2°5871 3°3476 4°6629 
3:3 | 1°5493 1°5337 1°5977 1°7417 1°9797 | 2°3467 2°9140 3°8294 
3: 1°5534 1°5180 1°5625 1°6806 L'S776 | 2°1746 2°6166 3°2922 
3:5 | 1°5637 1°5094 1°5378 1°6359 1°8026 | 270498 2°4061 2°9282 
36 | 1°5799 1°5073 1°5223 1°6051 1°7490 | 1°9595 2°2550 2°6725 
37 | 1°6018 1°5114 1°5161 1°5866 1°7133 1°8963 2°1471 2°4899 
38 | 1°6293 1°5220 1°5178 1°5803 1°6938 | 1°8556 2°0726 2°3622 
39 | 1°6624 1°5397 1°5301 1°5867 1°6902 | 1°8351 2°0257 2°2740 
40 1°5665 1°5549 1°6080 1°7033 | 1°8343 2°0032 2°2188 
| 1°7464 1°6062 1°5967 1°6477 1°7358 | 1°8541 2°0041 2°1924 
| 1°7978 1°6667 1°6633 1°7120 1°7917 | 1°8969 2°0289 2°1931 
| 1°8561 1°7653 1°7687 1°8106 1°8777 1°9669 2°0798 2°2207 
1°9221 1°9436 1°9381 1°9588 2°0034 | 2°0704 2°1558 2°2770 
45 | 179965 2°3211 2°2203 2°1819 2°1838 | 2°2167 2°2820 2°3652 
46 — 3°4126 2°7193 2°5226 2°4412 | 2°4189 2°4376 2°4905 
4? _— — 3°7109 3°0601 2°8113 | 2°6967 2°6538 2°6602 
48 — 6°4341 3°9632 3°3544 | 3°0796 2°9428 2°8847 
49 5°7005 41910} 3°6148 3°3289 3°1786 
50 10°6664 5°6242 | 4°3834 3°8501 3°5624 
51 8°1895 | 5:°5415 4°5669 4:0669 
5:2 16°5324 | 7°4444 5°5857 4°7390 
— — — 24°9391 9°6439 6°9532 
55 -- — 14°7870 8°9040 
56 -- — 38°3578 | 12°1682 
58 — - — 20°9345 
| 5:9 — — — — — 


= 

= 
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TABLE IIl—(continued). 
Ratio of the Probable Error of the Mode to the Probable Error of the Mean. 


8 9 1:0 it 1°2 1°3 15 
13238 “7927 “4608 0 22 
2°2924 1°3375 *8044 “4693 "2366 0 23 
4-2616 2°2914 1°3481 *8148 2417 0 24 
9°2167 4°1795 2°2856 1°3563 8241 “4846 "2466 0 25 
27-3963 8°6715 4-0960 2°2766 1°3627 "8324 “4916 "2514 | 2°6 
214-2686 23°4630 8°2243 4°0130 2°2652 1°3675 *8399 4982 | 2-7 


536-8306 | 132°4970] 20°7275 7°8248 3°9320 2°2521 1°3711 8468 | 28 
49°3159 | 2365°9521 | 92°5373 18°5702 74670 3°8536 2°2381 13737 | 2:9 
18-8973 73°5471 69°7007 | 16°6666 7°1439 3°7785 2°2233 | 3-0 
10-6407 24°5570 | 115°3008 | 2797°9031 | 55°2575 | 15°3999 6°8551 3°7067 | 3-1 

771985 13°0637 | 32°2652 195°0629 | 754°5445 | 45-4504 14°2059 65923 | 3-2 
5*4232 8°5553 | 16°0888 43°0412 | 373°6907 | 360°1882 38°4326 | 13°1965 | 3-3 
4°3841 6°3070 | 10°1749 19-9059 | 58°6048 | 902°3059 | 216°7873 | 33°2041 | 3-4 
3°7238 5°0188 7°3339 12°1171 | 24°7849 | 82-0317 | 3859°0983 | 147-8862 | 3-5 
3°2808 4°2123 5°7431 8°5285 | 1474603] 31°1179} 119°2455 36 
2°9731 3°6767 4°7652 6°5690 9°9220 | 17°3081 39°4906 | 182°8220 | 3-7 
2°7556 3°3080 4°1186 5°3807 T5111 | 11°5534 20°7994 | 50°8071 | 38 
2-6024 3°0487 3°6782 4°6092 6°0748 8°5876 13°4722 | 25°1232 | 3-9 
2°4975 2°8664 3°3701 4°0854 5°1532 6°8542 9°8206 | 15°7414] 
2°4316 2°7414 3°1538 3°6557 4°5324 5°7559 7°7303 | 11°2373 | 4:1 
2°3991 2°6619 3°0049 3°4647 4°1019 5°0222 6°4239 87171 | 42 Bo. 


2°3972 2°6207 2°9087 3°2876 3°7999 4°5157 55588 7°1650 | 4:3 
2°4252 2°6139 2°8560 3°1713 3°5899 4°1606 4°9647 671470 | 4-4 
2°4841 2°6398 2°8417 3°1043 3°4497 3°9126 4°5484 5°4516 | 4°5 : 
2°5767 2°6988 2°8630 3°0798 3°3655 3°7440 4°2566 49654 | 4°6 
2°7072 2°7928 2°9196 3°0943 3°3282 3°6396 4°0565 4°6235 | 
2°8822 2°9256 3°0127 3°1462 3°3334 3°5871 3°9275 4°3865 | 4°8 
3°1106 3°1026 3°1451 3°2360 3°3784 3°5806 3°8567 4:2298 | 4°9 oe 
3°4051 3°3318 3°3217 3°3659 3°4629 3°6166 3°8362 4°1379 | 5°0 
3°7831 3°6239 3°5490 3°5398 3°5883 3°6938 3°8611 4:1008 | 5-1 
4-2700 3°9939 3°8366 3°7633 3°7576 3°8129 3°9289 4:1125 | 5:2 
4-9030 4°4629 4°1971 4°0446 3°9757 3°9760 -4°0402 41696 | 5:3 
5-7402 5°0710 4°6485 4°3947 4°2495 4°1872 4°1957 42711) 54 
6°8793 5°8339 5°2155 4°8287 4°5884 4°4523 4°3988 44175 | 
8-4860 6°8516 5°9343 5*3673 5°0054 4°7796 4°6543 46113 | 56 
10-9159 8°2317 6°6993 60399 5°5178 5°1798 4°9694 4°8565 | 5°7 
15-0461 10°1883 8:0738 6°8654 6°1501 5°6681 5°3533 5°1589 | 5°8 
24-2232 13°1659 9°7223 7°9782 69362 6°2646 5°8189 55274 | 5-9 
18°3208 | 12°0456 9°4138 7°9259 69971 63809 59709 | 6:0 
30°4688 | 15°6688 11°3534 9°1941 7°9053 70692 6°5057 | 6-1 
— 220403 1471326 | 10°8607 9°0461 79092 71508 | 6:2 
38°1802 18°4461 | 13°1310] 10°5089 8°9484 79325 | 6:3 
26°2704 | 16°3951 | 12°4328 10°2534 8°8874 | 64 
— — — 47°9585 | 21°5227| 15°0620 11°9257 | 10°0672 | 6°5 
— — 31°0936 | 18°8677 14°1298 11°5483 | 6°6 
60°8817 | 24°9280 17°1544 | 13°4480 | 6°7 


— _ 36°6168 21°5651 | 15°9472 | 6-8 
79°1531 28°6967 | 19°4168 | 6-9 
42°9814 | 24°5042 | 7 
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its standard deviation is at least an element of importance. It is possible, of 
course, that the distribution of the mode may be skew. The arithmetical work 
has been extremely laborious, but will have been repaid, if it enables statistical 


workers to obtain rapidly an important measure of accuracy by a minute or two of 
additional computation. 


CHAPTER III. GrapHic REPRESENTATION OF THE RATIO OF THE PROBABLE 
ERROR OF THE MODE TO THE PROBABLE ERROR OF THE MEAN BY A SURFACE. 


(1) Contour Lines of the Surface of the Probable Error of the Mode. It will be 
seen from Table II that the standard deviation of the mode is usually greater than 
that of the mean and therefore the mode cannot in general be determined with 
the same accuracy as the mean; within however a large range of values of 8,, Bz, 
such as most frequently occur in statistical practice, it can be found with adequate 
accuracy, if the size of the sample be reasonably large. For small samples, not 
only would our method of approaching the problem be illegitimate, but the mode 
would have no practical value at all. 


It will be noted, however, that in a part of the U-curve area the mode, in this 
case an “antimode,” can be determined with greater accuracy than the mean, and 
it is easy to see that this must physically be so. 

In order to get some idea of the nature of the distribution of accuracy of 
determination of the mode, we may denote the ratio of the standard deviation of 
the mode to that of the mean by z, and consider the geometry of the surface : 

=f 
which our Table II represents. 

It is clear that there are two asymptotic planes: the first, 58, —68,—9 = 0, 
is the vertical plane through the rectangular line, and the other will arise 
when 8,=2 or 88,—158,—36=0. In actual distributions, of course, such a 
value of 8, cannot occur, but when we fit a Pearson curve by the first four 
moments 8,= is possible. The surface asymptotes on both sides to the first 
plane, and it is sufficient to limit it by the second plane for practical purposes, 
as values of 8, and 8, beyond this are very rare. In order to obtain the contour 
lines of the surface a double series of sections were drawn: ie. those for B, 
constant and those for 8, constant. Additional points on these sections were 
obtained, by interpolation, where needful. From these sections the points corre- 
sponding to a constant value of z were read off and plotted to obtain a given 
contour. In this manner Diagram I was obtained. 


(2) Minimal Lines. Considerable attention has been devoted to the points 
on both sections where the series of tangents to the curves are horizontal, i.e. to 
the points on the curves where z is a minimum. The locus of these points on the 
contour diagram may be termed a “minimal line.” The minimal line for the 


8,-sections may be termed the 8,-minimal line and that for the £,-sections the 
8.-minimal line. 
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For determining with adequate accuracy the position of the minimum point 
of any section, the six-point method in most cases, but in a few the five or four- 
point method, as given in Tracts for Computers, No. II, p. 36, was adopted. <A 
brief description of the method is as follows: 

Let the ordinates at six equidistant points a, #2, 7, #,,#; and a, be 2%, 2, 2s, 
2,, 2, and 2, respectively, the origin being mid-way between a, and a,. 

The abscissa @ and the ordinate Z of the minimum point can be found by the 
following equations : 


B — 2 — 2 (2, + — 25} + — 2, — 6 (2, — 2) — 6 (2, —2;) — (25 — 25)} 


+ {— (Ze — + 98 (24 — 25) — (2 — 25)} (31). 
Writing this equation ax + bx+c=0, 
we have : + 24) — Ged t + (32). 


The §,-minimal line is fairly straightforward and the minimum values of z 
and the corresponding values of 8, being obtained, the minimal curve can be plotted: 
see Diagram II. It was clear that this curve could be represented fairly well by a 
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parabola of the sixth degree. Such a parabola was determined by the method of 
least squares and its equation was 
y = 4202293 + 05608703 + 0005887 2242* — 0001020906 2* 

+ 000006487695 + -0000005543824.2° — 
where y is the calculated value of 8, and # = 20 (8, —°75). Table III compares the 
values of 8, found from this curve with the plotted values as obtained by com- 
putation from Table I. 

TABLE III. 


Minimum Calculated 

Bi | value z Bs by (33) 

0 -15 15492 | 3°2884 3°2890 

‘1 —13 1°5071 | 3°5833 3°5830 

| 3 —11 1°5157 | 3°7318 3°7272 
9 1°5803 | 38019 3°8082 
*4 | 16896 3°8729 3°8751 

= 1°8323 | 3°9541 3°9510 

6 3 20007 | 4:0462 4°0424 

| 271894 4:1476 4:1469 

8 1 2°3944 4:2562 42589 
9 3 | 2°6125 43707 4°3737 
1-0 5 2°8414 44892 4°4898 
11 7 3:0796 46119 4°6089 
1-2 9 3°3254 4°7371 47343 
13 | 11 3°5779 4:8647 | 4:8679 
1-4 | 13 3°8355 5:0043 | 5:0043 
1°5 | 15 4:0994 5°1249 | 571245 


The accordance is within about 1 per cent. and this may be considered reason- 
able for present purposes. The 8,-minimal line is less simple, because for the 
portion of the surface dealt with in our diagram it consists of two separate parts, 
each of which has been fitted by an independent parabola of the third degree. The 
results are exhibited in Tables IV and V. The intersection of the §,-minimal 
and the §,-minimal curves gives the “bottom of the basin” in the solid surface, 
and this point has been determined as 


2=1:505, at B,=°134, 6,=3°64. 


This point corresponds on the (8,, 8.) diagram* to a curve of Pearson’s 
Type IV, a skew curve, and not, as might @ priori have been supposed, to a 
normal curve. We see accordingly that for points in the “basin” the mode as 
determined by the method of moments is always 50 per cent. more inaccurate 
than the mean, and usually much more than this. Still the mean can be so 
accurate for large samples that we may learn something from determining the 
mode even when its probable error is a considerable multiple of that of the mean. 


The equation of the parabola fitted to one part of the 8.-minimal line is 
y = 153384 + °00831818a — ‘001950782? — 00004203768a* ...(34). 


* See Tables for Statisticians, p. 66, 
Biometrika 19 


. 
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TABLE IV. 
Minimum | y calculated 

By “ value z | Bi | by (34) 
3°3 -5 1°5298 | 06848 ‘06828 
3°4 -4 1°5177 | ‘09166 | °09159 
3°5 -3 1°5089 | "11141 | °11201 
3°6 1°5039 12936 | “13078 
3°7 -1 1°5041 °14279 | "14316 
3°8 0 15099 | | *15338 
3°9 1 15241 | *16023 | “15971 
2 15495 | ‘16207 | ‘16039 
4°1 3 15906 °15927 "15965 
4°2 4 1°6554 ‘15181 *15275 

| 4:3 5 | 1°7598 ‘14161 | 14095 | 


y=computed value of 8;. 
a= 10 (B.—3°8). 


It will be seen that the deviation of 8, from its computed value is as a rule 
less than 1 per cent. 
The equation of the parabola fitted to the other part of the 8,-minimal line is 


y =°9205583 + 0516564 — -00100021 a? + 0000422167 a> ...... (35). 
TABLE V. 
| Minimum | y calculated 

value z Bi by (35) 

44 | -11 19368 | -17386 17512 
45 | -9 2°1787 ‘34301 34386 
46 7 2°4189 | 49925 49547 
47 | 2°6512 | 63419 63199 
46 2°8774 T5425 “75545 
49 |-1 3-0995 86453 “86786 
50 1 3°3189 96775 97126 
3 | 35362 | 1:07096 1:06767 
5 | 37524 1°15870 115911 
7 | 39678 1°24915 1:24762 
9 | 4°1825 1°33688 1°33523 
11 4°3971 1°42205 1°42395 

| 


y=computed value of 
10 (8,— 4°95). 

(3) Model of the Surface. The surface was modelled by cutting out the 
contours and building them up at the appropriate values of z. It is seen to be of 
a very striking and suggestive kind. Photographs of it are provided in Plates 
I and II. 

The reader will find it desirable to compare the contour diagram and the 
surface. He will then notice that the surface consists essentially of a first portion 
which we have termed the “basin.” This, after a fairly flat portion, rises abruptly 
to the two asymptotic planes. The asymptotic plane to the right of the figures in 
Plate I, but to the left in Plate IJ, is that which bounds the area where the 


bd 
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moments of Type IV curve may become infinite and which is marked on the 
(8, 82) diagram as the heterotypic area. The boundary of the “basin,” on the 
left of the photographs on Plate I and on the right of the photographs on Plate II, 
is the vertical plane through the “rectangular line” and beyond this we have 
another portion of the surface of a scroll-like form which sinks below the bottom 
of the “basin” and covers the types of curves which may be found above the 
“rectangular line,” including the U- and some J-curves. The portion of the 
surface shown is limited by three considerations: (1) all real values are limited by 
8, =0, hence the surface is not shown below this plane; again (2) the tables have 
been limited by 8,=1°5. which cuts off the back part of the “basin” as shown in 
Plate I; unlike the front part of the “basin,” the value here of the Probable Error 
of the Mode would be real but some limit had to be given to the tables and the 
model; (8) a similar limit to the tables is given by 8,=7:0. On the “scroll” 
side a limitation has been made at 8,=1°8, although it would have been more 
proper to have carried it to 8, = 1, and limited it by the straight line 8, — 8, -1=0. 
This is the contour line marked 0 in Diagram I of which however only a portion 
is shown. The contour lines on this portion of the scroll are practically straight 
lines and for the purposes of the model it was desirable to limit the size. (4) For 
values lying between §,=1 and 1°8, and the corresponding value of f, falling 
below the line 8,—8,—1=0, the contour lines of Diagram I are so approxi- 
mately straight that for practical purposes the value of the probable error may be 
found by producing them. 


IV. APPLICATIONS. 

I propose now to consider by a few examples the application of the above 
results to determine the probable error of the mode. I shall use several cases of 
skew distribution, which have already been adopted in earlier researches to illus- 
trate various statistical points. By hatched spaces on the diagrams I shall 
indicate the relative significance of the probable errors of mode and mean, 


Example I. Barometric Height at Laudale (4018 observations)*. 


eeiht ic | 27:9 | 28-0 | 28:1 | 28-2 | 28:3 | 28:4 | 28:5 | 28-6 | 28-7 | 28:8 
Frequency 1 1 0 0 5 2°5 2 45 | 85 12 
28:9 | 29:0 | 29-1 | 29:2 | | 29-4 | 29°5 | 29°6 | 29-7 | 29-8 


Frequency | 32°5 | 47°5 | 72°0 | 87°5 | 125°5 ] 187°5 | 255°5 | 304 | 346°5| 394 


Height 29°9 | 30°0 | 30°71 30°2 | 30°3 | 30°4 30°5 | 80°6 | 30°7 30°8 


Frequency | 380°5| 400 is 376 | 266°5| 182 | 94 | 505] 21 2 


* Phil. Trans. Vol. 190 (1897), pp. 423—469. 
19—2 
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The moments were calculated using Sheppard’s Corrections. We find: 
Mean= 29°85699, B, = 2039448, 


= 1478622, 8, = 3'200531, 
=— 25°67687, o = ‘3845285. 
= 699°7395, 
The distance between the mean and mode: 
d = 093161, 
and Mode= 29°95015, 
K =- ‘7629264, so Type I curve was used, 
= 5683636, = 458:09798, 
b = 77°65624, 
m = 4611299, m,= 8'72339, 
ay = 65°30266, = 12'°35358, 
yo 421-22. 
The equation to the curve is 
46°1130 \8'7234 


In finding the ratio of the probable error of the mode to the probable error of 
the mean the interpolation was made from Table II, using the Mid-Panel Central 
Difference Formula in Tracts for Computers, No. III, p. 7. Let the interpolate 
divide the square formed by the four nearest interpolants in the a-ratio 0 
= 1-8), and the y-ratio y = 1— y), then 

20, x = 2,0 + + Ox2;, 1 
— {1 + + 20,1) + + 8) (Wd? + 
— + (G8" + 08" 21,0) + (1 + x) + 08" 2,,,)}  ...(36). 


33 BY 
15578 1°5337 
Z-1,0 21,1 
‘2 1°7093 1°6455 15977 15625 
3 1:9340 1°8238 17417 1:6806 
21,0 41,1 44,2 
‘4 2°1187 1:9797 
22,0 


6 = 039448, = '960552, 
x = 005312, = ‘994688. 
= 0906, = °0160, 
82,,='1166, = "0281, 
= 0800, z,,, = 0126, 
= 0940, = 0210. 


j i 
{ 
| 
2 
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Whence from equation above 2, , = 1°6503. 
Probable error of mean = °00410, 
mode = ‘00675. 
Mean = 29°8570 + ‘0041, 
Mode = 29°9502 + ‘0068. 


In the accompanying Diagram III the ranges of likely variation of mean and 


mode are represented by belts of 24 times the probable error plotted on either 
side of the mean and mode. 


BAROMETRIG HEIGHT AT LAUDALE 
500 


Belt = Mode t 2-5 P.E. = 29-9502 2-5x -00675 
4 = 29-933 jo 29-967 


Belt = Meare + 2-5 PE. = 29-8570 t 2-5x-004140 
= 29-847 to 29-867 


Modal Ordinate = 421-22 


FREQUENCY 
I 
Mean 


100 


28-0 28:5 30-0 30:5 
BAROMETRIC HEIGHT 


Diagram III. 
Example II. Glands of the Fore-Legs of Swine. Right Leg ¢. 
In the Proceedings of the American Academy of Arts and Science, Vol. XXxi1. 
p- 87, 1896, C. B. Davenport and C. Bullard treated the variation in number 
of the Miillerian glands in the fore-legs of 4000 swine, and Professor Karl Pearson 
discussed their material in a memoir in the Phil. Trans. Vol. 191 A, pp. 289—296. 
The following table gives the distribution for 2000 sows: 


Glands | 4 6 8 | 9 10 | 
7°5 | 104-5 | 1825] 241 | 207 |1385| 67 | 36 | 11 | 4 | 1 | 


{ 
| 

| 

| 
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In computing the moments no correction was made for grouping because this 
is a case of frequencies concentrated at mid-ordinates. We have 


Mean = 3°50100, B,= °2591775, 
= 2°824999, B.= 3110825, 
fs = 2417275, « =— ‘37811. 


= 2482631, 
The frequency distribution is of Type I: 


 =19°98608, e = 72°72580, 
m, = 3°78391, M, = 14°20217, 
a, = 379632, ad, = 1424873, 
b  =18:04505, yy = 237-2621, 
d = 
Mode= 2797802. 

The equation to the curve is 
\3°7889 \ 142028 
y=237262(1+ 


By an interpolation from my Table II the ratio of the probable errors was 
found to be 1°8155. 


/ 2824999 
Probable error of mean = ‘67449 a 


= ‘02535, 
Probable error of mode = ‘02535 x 1°8155 
= 04602. 
Hence Mean = 3°5010 + ‘0253, 


Mode = 2°9780 + 0460. 


The result is indicated in Diagram IV. The modal and mean belts or zones 
are clearly separated and the modal belt sufficiently narrow to be of value. 


Example III. Professor Weldon’s Crab Measurements No. 4*. 


We will now turn to a nearly symmetrical curve given by the following 
series of 999 observations : 


| Index | 1 | 2 3 4 5 | 6 7 8 | 9 10 
| Frequency | 1 | 3 | 5 | 11 | 40 | 55 | 98 | 1a1 | 152 | 147 | 
| Index | 11 | 12 | 1 | 14 | 1 | 16 | a7 | 18 | 19 | 20 

| Frequency | 126 | 82 | 72 | 41 | 98 | 8 | 7 | o | o | 2 | 


* Phil. Trans. Vol. 185 A (1894), p. 96. 
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Moments were computed applying Sheppard’s Adjustments. I found 
= 7425852, = 029370, 


= 3467966, = 3°137155, 
= 172-9930, « = ‘119199. 
Type IV curve fits best. 
r = 67°92055, m = 3496028, 
=— 2498612, = 22°15894, 
d= 2201479, Origin= 1533014. 
Mean = 9°684685, Mode = 9°464537. 
 1°76447, o= 272504. 
‘aie MULLERIAN GLANDS IN THE FORE LEGS OF SWINE (oavenporr & BULLARD) 
Belt = Mean 25 PE, = 3-5010 t 2-5 x 
200 = 3-438 to 3-564 
Belt = Mode 2-5 PE. = 2-9780* -0460 
= 2°863 to 3-093 
Modal Ordinate = 237-26 
150 
» 
4H 
100 + 
50 
T 


NUMBER OF GLANDS 


Diagram IV. 
The ratio of probable errors was calculated to be 1°5623. 
Probable error of mean = ‘0582, 
» mode = 0909. 
Mean = 9°6847 + ‘0582, 
Mode = 9°4645 + ‘0909. 


e 
ri) 4 2 3 4 5 6 7 8 9 10 ed 
| 
i 
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The equation to the curve is 


—34-9603 94.996) tan-1 


1580 
y = 176447 (a + 


This curve was selected because it is very nearly symmetrical, and illustrates 
a case in which the “modal belt” (see Diagram V) actually overlaps a portion 
of the “mean belt,” and thus we cannot, without further investigation, assert that 
mean and mode are not identical. 


MEASUREMENTS OF THE CARAPACE IN NAPLES GRABS (wexpor) 


Belt = Mena t 25 PE. = 96847 2-5 x-0582 
= 9-539 to 9-830 
Bolt = Wfode 2:5 PE. = 9-4645 2-5 x-0909 
= 9-237 to 9-692 
Modal Ordinate = 140-02 


FREQUENCY 
8 


Mode 


20} Z 
Z 


o 1 2 3 + 5 6 7 8 9 10 12 13 4% iy 16 7 18 19 20 
SCALE: UNIT = -004 OF BODY LENGTH 


Diagram V. 


Example IV. Distribution of 8689 cases of Enteric Fever received into the 
Metropolitan Asylums Board Fever H. ospitals, 1871—93. 


Number of Number of 
Age cases Age cases 

| Under 5 266 35—40 299 

| 5—10 1143 40-45 163 

| 10—15 2019 45—50 98 
15—20 1955 50—55 40 

| 20—25 1319 55—60 14 

| 25—30 857 Above 60 13 
30—35 503 

| 
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I adopted Professor Karl Pearson’s redistribution of the tail* dividing the group 
“ Above 60” as follows: 

60—65, 8; 65—70, 4; 70—75, 1. 

The above frequency distribution has high contact at the old age end but not 
at the childhood end, thus abruptness corrections are necessary for the calculation 
of the moments, and for that purpose it becomes desirable to know the sub- 
divided distribution of the group “ Under 45” into, say, 5 yearly groups. 

No data being available for the sub-division for the years 1871—93, the age 
incidence in the first five years of life provided by the Reports of the Medical 
Officer of Health for the County of London, for the years 1909—14, which pro- 


vided 253 cases, were taken, and formed suitable material for adjusting the 266 
cases in the above data. They run: 


Age ... 1—2 . 2-3 
Number 14 21 48 78 92 
Sub-dividing the group of 266 under 5 proportionally to the above we get 
Age ... 1—2 2—3 3—4 4—5 
Number 15 22 50 82 97 


INCIDENCE OF ENTERIC FEVER AT VARIOUS AGES FROM 1871 — 1893 


Belt = Meant 2-5PE. = 18-9680 2-5 *-0707 
= 18-791 fo 19°145 


Belt = Mode t 2°5PRE. = 13-3884 * 2-5x-2110 
= 12°B61 fo 13-916 
1,500 } ModN Ordinale = 2003°5 


1,000 


500 N 


NUMBER OF CASES 


Mode 
Mean 


T T 
10 20 30 50 60 
AGE IN YEARS 


Diagram VI. 


The Moments were found, using abruptness corrections} and taking five years 
as the unit, 


= 3'821776, B,= 1033412, 
= 7595118, B.= 4348825, 
= 6351882, = — 2424407. 


* See Phil. Trans. Vol. 186 A (1895), pp. 390—392. 
+ Biometrika, Vol. xu. pp. 231—258. 
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Type I must be used. 


= 3450820, 
b =43:11413, 
m,= 1°736247, 
a, = 2°302704, 
d = 1:115930, 


Mean = 18°9680 years, 
The equation to the curve is 


y = 2003'53 (1 + 


hood of modal variations. 


Example V. Cloudiness at Breslau, 1876—85 (3653 observations)*. 


We will now take an illustration which indicates how an “antimode” can be 
more precise than a mean. The following data provide a U-curve. 


Degrees of Cloudiness at Breslau. 


x 
23027 
The ratio of probable errors was found to be 2°9825. 
Probable error of mean = °014146 in working units, 
= ‘070728 years, 
Probable error of mode = ‘21095 years. 
Mean = 18°9680 + :0707, 
Mode = 13°3884 + ‘2109. 
The accompanying Diagram VI gives the “likely” belts of variation. It will 


be seen that mode and mean are quite distinct, and if the modal belt be larger 
than the mean belt, it still represents quite valuable information as to the likeli- 


e = 108°69359, 


Yo = 2003°526. 
Mode = 13°3884 years. 


Degree ... 0 1 


3 


9 


Frequency (days) | 751 | 179 


107 


69 


117 


From the above data the following values result : 


18°29987, 
6120297, 
583:1839, 
Type I is the suitable curve. 
r= "1795807, 
b = 9981426, 
m=— ‘8774224, 


* Proc. R. S. London, Vol. xu. pp. 287—290. Data taken from Hugo Meyer’s Anleitung zur Bear- 
beitung meteorologischer Beobachtungen fiir die Klimatologie, Berlin, 1891, 8. 108. 


o = 1954936, 
m,= 30°77195, 
= 40°81143, 
= m | | 9 | 21 | 71 | 194 | mm | 2089. 
Bi= 6112252, 
Bo= 1741445, 
=— 1538507. 
3 = ‘0069873, 
o = 4277834, 
m,=— ‘9429969. 
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Both m, and m, being negative, the curve is of U-shape. 
a, =4810939, a, = 5°170487, 
d = 2°002148, Yo = 50°74810, 
Mean = 6°829181, Antimode = 4827034. 
Hence the equation to the curve is 


— "8774 —"9430 
y =50°7481 (1 53105) ) 


48109 ~ 51705 
GLOUDINESS AT BRESLAU, 3,653 DAYS 
5,000 
Belt = Mear + 2-5PE.= 6-8292 2-5 x -04774 
= 6-7099 to 6°9486 
Beli= Mode + 2-5RE=4-8270 2-5 x -01240 
=4-7960 to 4-8580 
Modal ordinate = 50-748 
4,000 # 
3,000 
(=) 
5 
= 2,000 } 
1,000 + Z 
Z 
3 
® 


Oo 1 2 3 4 5 6 . 8 9 10 
DEGREES OF CLOUDINESS 


Diagram VII. 


+ 4 
| 
| 
i. 
| 
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The ratio of the probable errors was calculated to be *25974. 


Probable error of mean = 04774, 
»  antimode = ‘01240. 
Mean = 6°8292 + °0477, 


Antimode = 4°8270 + 0124. 


It is clear that the standard deviation of the antimode is only about a quarter 
of that of the mean, and the ratio in this case corresponds to a value of z on the 
“scroll” part of the surface (Plate I, A), lying below the bottom of the “ basin.” 


The above give illustrations of the error that is likely to occur in the position 
of the mode as determined from the four-moment formula. On the corresponding 
diagrams belts are indicated with ranges of 5 times the probable error round the 
mean and the mode. It will be seen that while the belt about the mean is very 
much smaller in most cases than the belt about the mode, yet the latter is 
sufficiently narrow to show that the mode, as determined by the four-moment 
formula, has practical significance. In every case such displacements as are 
suggested as probable by the modal belt are not such as would upset any reasoning 
which is likely to be based in practice on the observed position of the mode. 
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e portion of the surface lying outside the asymptotic 
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A shows th 


plane through the “ rectangular line.” 


Photographs of the surface which gives the Probable Error of the mode, 


B and C show the “basin.” 
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Kazutaro Yasukawa, Skew Frequency Distributions 
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D and E. Photographs of the vertical aspect of the surface, showing the 
“basin” from above and the two asymptotic planes, 
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UEBER DIE ANWENDUNG DER DIFFERENZENMETHODE 
(“ VARIATE DIFFERENCE METHOD”) BEI REIHENAUS- 
GLEICHUNGEN, STABILITATSUNTERSUCHUNGEN UND 
KORRELATIONSMESSUNGEN. 


Von OSKAR ANDERSON, Prof. der Polit. Oekon. und Statistik an der 
Handelshochschule Varna, Bulgarien. 


(1) Bet Studium statistischer Reihen und, insbesondere, bei Korrelations- 
messungen, kommt es nicht selten vor, daB man sich veranlaBt sieht, die betreffen- 
den Reihen in einzelne Komponenten zu zerlegen. So unterscheidet man bei 
Reihen, welche die Evolution der Ernteertriige einer gewissen Landschaft dar- 
stellen,—eine “sikulare,” “evolutorische,” oder iiberhaupt “ glatte ” Komponente, 
die den Einfluf des Fortschrittes landwirtschaftlicher Technik wiedergibt, und eine 
“ unregelmiaBige,” “ oszillierende,” zufiallige” Komponente, die unter dem EinfluB 
des Klimas oder der Witterung eines gewissen Jahres steht. Bei “ Preiskurven ” 
unterscheidet man oftmals eine “saikulare” und eine wellenartige “ Konjunktur-” 
Komponente, ferner “Saison”-Komponente und eine unregelmafige “ restliche” 
Komponente, hervorgerufen durch Zufalligkeiten des Bérsenspiels, Paniken, Kriege, 
us.w. Es ist eine Anzahl verschiedener “ Zerlegungsmethoden” fiir statistische 
Reihen vorgeschlagen und mit gréBerem oder geringerem Erfolge erprobt worden. 
Als eine derselben kann auch die Differenzenmethode (Variate-Difference-Method) 
angesehen werden. Das, was letztere, aiuBerlich, von den iibrigen Verfahren unter- 
scheidet, ist der Umstand, da sie nur bestimmte lineare Funktionen der gesuchten 
“zufilligen ” Reihenkomponente ergeben kann, wihrend erstere die AnmaSung 
haben, die Reihe in ihre “ wahren” Komponenten mehr oder minder genau zu 
zerlegen. Ob mit Recht—soll im weiteren untersucht werden. 


(2) Die bei Zerlegung statistischer Reihen gewéhnlich benutzten Methoden 
laufen letzten Endes darauf hinaus, das eine oder andere hauptsichlich fiir Sterb- 
lichkeitsmessungen ausgearbeitete “'Tafel-Ausgleichungsverfahren” anzuwenden. 
Ein Charakteristikum dieser Verfahren besteht darin, daB sie immer die “glatte ” 
(wellenartige oder evolutorische) Komponente zu erfassen suchen, das heiBt, nur 
deren “ wahrscheinlichsten” oder “vorteilhaftesten” Wert im Bereiche gewis- 
ser Fehlergrenzen bestimmen. Die “restliche” Komponente wird dann durch 
Subtraktion der “ausgeglichenen” von den urspriinglichen Werten der Reihe 
gewonnen. 


Drei Methoden kommen in erster Linie bei solchen “Ausgleichungen” in 
Betracht*. Hrstens, kann man eine hypothetisch angenommene oder (in seltenen 


* Vergl. E, Czuber: Wahrscheinlichkeitsrechnung, 2 Aufi., 1910, u. Band, 8. 171, 
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Fallen) & priori gegebene Funktion der Beobachtungsreihe anzupassen suchen. 
Gewohnlich nimmt man an, daf die “ glatte” Komponente entweder durch eine 
ganze rationale Funktion (eine Gerade oder eine Parabel mehr oder minder hoher 
Ordnung) darstellbar ist, oder aber auch durch eine mehr oder weniger kom- 
plizierte trigonometrische Funktion (Fourier Analyse in ihren verschiedenen 
Formen*). Wird dabei die Zahl der Parameter dieser Funktionen geringer, als 
diejenige der Reihenglieder, genommen, so miissen die “ vorteilhaftesten” Werte 
der Parameter gefunden werden. Gewéhnlich geschieht das durch Anwendung der 
Methode der kleinsten Quadrate oder eines ihrer Derivate : Methode der Momente, 
Methode der Flachen (Cantelli), u.s.w. 


Zweitens, kann man “mechanische Ausgleichungsmethoden” gebrauchen. Diese 
“gehen von der Erwaigung aus, daB die Stérung eines Einzelwertes (das heiBt in 
unserem Falle: der “glatten Komponente ”) in einer Beobachtungsreihe sich auch 
noch in den beiderseits benachbarten Gliedern bemerkbar mache, da8 jedoch eine 
sehr geringe Wahrscheinlichkeit dafiir bestehe, es werde eine lingere Folge von 
Einzelwerten in gleichem Sinne beeinflu8t sein; viel eher sei zu erwarten, daB sich 
durch den Wechsel des Sinnes auch in kiirzeren oder lingeren Abschnitten ein 
Ausgleich im Verlaufe der Erscheinung vollziehe+.” Von Wittstein und Woolhouse 
bis W. F. Sheppard und E. C. Rhodes ist hier eine Anzahl verschiedener Ver- 
fahren vorgeschlagen worden, von denen einige die auszugleichende Reihe durch 
eine Anzahl von ineinandergreifenden Parabeln darzustellen versuchten ; in vielen 
Fallen wurde auch zur Methode der kleinsten Quadrate gegriffen. 


Drittens, werden vielfach graphische Ausgleichungsmethoden geiibt, wobei “die 
zusammengehérigen Wertepaare durch Punkte in einem Koordinatensystem dar- 
gestellt und nun eine Kurve verzeichnet (wird), die dem Zuge der Punkte méglichst 
getreu folgt und ihnen ‘ méglichst nahe’ liegtt.” Da ein solches Verfahren, obwohl 
es in geschickten Handen zu sehr befriedigenden Resultaten fiihren kann, schlie6- 
lich nur auf Vertrauen zur Persénlichkeit des Forschers beruht und auch eine 
wirkliche Bestimmung der méglichen Fehlergrenzen der gefundenen Resultate 
kaum zulai8t, so werden wir es im weiteren unberiicksichtigt lassen. 


(3) Wie oben schon angedeutet wurde, beruhen die gebrauchlichen Zerlegungs- 
methoden statistischer Reihen auf einer erweiterten Auslegung der Ausgleichungs- 
verfahren: es wird namlich stillschweigend angenommen, daB nicht nur die “glatte” 
Komponente (oder die “ glatten” Komponenten) einer Reihe durch die gefundenen 
“ ausgeglichenen” Werte ungefihr darstellbar seien, sondern da auch die Anniihe- 
rungswerte der “restlichen” Komponente durch eine gewohnliche Subtraktion der 
ausgeglichenen von den urspriinglichen Werten gewonnen werden kénnen. Hierbei 
wird das relative Gréfenverhiltniss der “glatten” und “restlichen” Elemente 
zu einander nicht besonders beriicksichtigt, obgleich die letzteren gewohnlich als 
die betrachtlich kleineren gedacht werden. Unseres Wissens, ist bis jetzt die 


* Natiirlich, kaan man trigonometrische Funktionen auch mit ganzen rationalen kombinieren. 
+ Czuber, loc, cit., S. 185. 
t Ebenda, S. 197. 


2 
4 
om 
2 


OskAaR ANDERSON 295 


logische Berechtigung eines solchen Verfahrens von niemanden besonders bean- 


standet worden, obgleich seine Korrektheit, wie wir gleich sehen werden, stark 
bezweifelt werden kénnte. 


Es sei, um die der Ausgleichungstheorie eigentiimliche Ausdrucksweise zu 
gebrauchen, je eine Bestimmung der NV Grifen G,, G.. Gs, ... Gy gemacht worden, 
wobei die Werte us, -.. uy gefunden wurden. Die wahren Fehler dieser 
Bestimmungen sind durch das System (1) gegeben : 


=—uy 


€y=—Uyt+ Gy) 


G soll hier die “ glatte” und (—e) die wahre “ restliche ” Komponente bedeuten. 
_ Auf eine, fiirs erste nicht genauer zu definierende, Weise sind nun die “angenia- 
herten ausgeglichenen” Werte von G,, G., ... Gy bestimmt worden. Es seien das 


die GréBen g,, 92, -.. gy. Dann sind die scheinbaren Fehler \ dieser Grofen durch 
folgendes System ausgedriickt : 


Eliminiert man u aus beiden Gleichungssystemen, so erhilt man unmittelbar : 


=A, 


Damit also die scheinbaren Fehler den uns einzig interessierenden wahren 
Fehlern gleich seien, wie es gewdhnlich angenommen wird, miissen alle G;— g; von 
bisi=N verschwinden. 


Nun sei aber folgendes festgestellt. Von seltenen Ausnahmen abgesehen 
kénnen die “ausgeglichenen ” Werte go, gy nur aus der Reihe w, uy 
berechnet werden, und zwar bestimmen die meisten Ausgleichungsmethoden, mit 
gutem Grund, den “ausgeglichenen” Wert g; als eine lineare Funktion F entweder 
aller (klassische Form der Methode der kleinsten Quadrate) oder nur einiger 
(mechanische Ausgleichung) Werte der beobachteten Reihe u. Nehmen wir an, 
k sei die Anzahl der fiir die Bestimmung eines jeden g; benutzten Glieder der 
Reihe u, so kénnen wir auch abgekiirzt schreiben : g; = F’(w;,,). Hierbei kann &, je 
nach der benutzten Methode, eine beliebige positive ganze Zahl zwischen 1 und V 
bedeuten. Ist nun F'(w;,;) linear, so hat man mit Beriicksichtigung von System (1): 


9: = =F ((G;- &), k] = F (Gi, x) — x). 


‘ 
| 
(1) 
=—h +H) 
€y=Ayvyt+ Gy- 
' 
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Und System (3) verwandelt sich dann in : 


Damit nun ein beliebiges \; dem wahren Fehler ¢; gleich sei, mu zwangslaufig 
bei beliebigem ¢ die Bedingung 


F =F (Gi, Gi 


erfiillt sein. Dieselbe fiihrt aber entweder zu der sinnwidrigen Annahme, da eine 
lineare Funktion der offenbar als mehr oder weniger “ zufillig” angenommenen 
wahren Fehler e durch eine fast identische lineare Funktion der wahren “glatten” 
GréBen G auszudriicken sei; oder aber mu’ man zugeben, daf dieselbe Funktion 
F wu gleicher Zeit fiir die wahren Fehler genau O und fiir die Griéfen G; ebenso 
genau dieselben G; ergibt, was ebenfalls einen nicht weniger sinnwidrigen Zusam- 
menhang zwischen G; und ¢; ergeben wiirde. Folglich, kinnen die Gleichungen des 
Systems (4) allenfalls nur im Durchschnitt, nur in mathematischer Erwartung 
annehmbar sein. Ist die Ausgleichungsformel richtig deduziert und korrekt ange- 
wandt, so kann man noch zugeben, daB allenfalls, bet gréferem e;, die Differenz 
F(G;,%)— G; im Vergleich zu diesem ¢; so klein sei, daB sie praktisch vernach- 
lissigt werden kénnte, fiir die Funktion F (¢;,;) ist aber auch das nicht gegeben. 
Meines Erachtens, ergeben daher die gebriuchlichen Ausgleichungsmethoden im 
besten Falle bloBh €:— F (6,4), ... (ey, x), keinesfalls aber die 
gesuchte Reihe «,€,...¢y*. Und kann auch, wie es nicht selten méglich ist, 
bewiesen werden, daf die mathematische Erwartung von e;—F'(¢;,,) gleich der 
mathematischen Erwartung von e; sei, so folgt daraus noch durchaus nicht, dab 
die mathematischen Erwartungen der Quadrate dieser Gréfen ebenfalls einander 
wenn auch nur ungefihr gleich seien. Und kann sogar letzteres bewiesen werden, 
so ergibt sich hieraus noch keinesfalls, daf die mathematische Erwartung des 
Produktes [e;— F'(¢;,%)][¢;—F(¢,~)] auch nur angenihert der mathematischen 
Erwartung des Produktes ¢;¢; gleich kommen muB. 


Wir gelangen also zu folgendem vielleicht etwas sensationell anmutendem aber 
durchaus logisch zwingendem Schlu8: insofern die iiblichen Zerlegungsmethoden 
statistischer Reihen Funktionen (zum Beispiel, Mittlere Fehler, Momente, Korre- 
lationskoeffizienten, u.s.w.) der bereckneten “restlichen Komponenten” (also schein- 
barer Fehler) \,, direkt fiir entsprechende Funktionen der wahren “ rest- 
lichen Komponenten” (wahren Fehler) &, €,...€y ansahen, sind diese Methoden 
nicht korrekt und kénnen zu Trugschliissen fiihren. Alle mit ihrer Benutzung 

* Die Trennung der Funktion G;—- F(G;,,) von der Funktion « — F'(e;,,) kann, im Allgemeinen, nur bei 
linearen und einigen wenigen nicht linearen Formen von F gelingen. Das von Prof. W. M. Persons in 
seinem ‘‘Correlation of Time Series” (Journ. of the Amer. Stat. Assoc., June 1923, p. 713 und ff.) 
vorgeschlagene Ausgleichungsverfahren ist weder linear, noch liefert es eine fiir Korrektionsberechnungen 


handliche Form von F'(e¢;,;,). Solange die Fehlergrenzen dieser Methode nicht bestimmt sind, diirfte 
sie daher nur als graphischen Ausgleichungsmethoden analog zu betrachten sein. 


PA 
| 
Ay = €y — F(ey, [Gy — F (Gy, 
| 
J | 
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vorgenommenen Untersuchungen sind zu revidieren und an den berechneten Koeffi- 
zienten Korrektionen anzubringen, die im Wesentlichen durch die entsprechende 
Form von F{e;,;,) bestimmt sind. Inwiefern diese Korrektionen die friiher gefun- 
denen Resultate umindern werden, hangt sowohl von der Formel des gebrauchten 
Ausgleichungsverfahrens, als auch, besonders, von den benutzten Funktionen von : 
ab. In einer Reihe von Fallen diirften die friiheren Forschungsergebnisse im 
Wesentlichen bestehen bleiben, in andern ist es dagegen zu befiirchten, daB sie 
eine betrachtliche Modifikation erleiden werden. Besonders bezieht sich letzteres 
auf feinere Untersuchungen, welche ja nur dann logischen Sinn haben kénnen, 


wenn die zufilligen Fehlergrenzen der gefundenen Werte im Verhiiltnis zu diesen 
klein sind. 


(4) Im Rahmen dieser Veréffentlichung vermégen wir, natiirlich, nicht, die 
verschiedenen Ausgleichungsmethoden von unserem Standpunkte aus zu sichten 
und deren Korrektionen zu bestimmen. Die jetzt folgenden zwei Beispiele dienen 
daher nur zum besseren Verstiindnis der bisherigen Ausfiihrungen. 


Auf S. 295—296 von E. T. Whittaker’s und G. Robinson’s The Calculus of 
Observations (London, 1924) finden wir die Sheppard’schen Ausgleichungsformeln 
fiir Parabeln bis zur 5. Ordnung und fiir Anzahl der bei der Ausgleichung benutzten 
Glieder von 3 bis 21 (das heibt, bis n= 10). Nehmen wir an: Ausgleichungs- 
parabel 2°" Ordnung, n = 2. Dann ist, in den Bezeichnungen des zitierten Buches, 
(wu —u'), die Differenz zwischen der beobachteten Gréfe u, und deren nach der 


Sheppard’schen Methode ausgeglichenem Werte u,, durch folgenden Ausdruck 
gegeben : 


Up — Uy = Up — ge {1 Tu) +12 (u, + uy) —3 + 
= [+ 4u_, + 6uy — + = 


wenn das Symbol du, die vierte endliche zentrale Differenz der Gréfe u, bedeutet. 
Wird also angenommen, da8 im Ausdruck fiir den scheinbaren Fehler — A, = uy — ty 
(s. oben, System (4)), kein “ glattes” Element G, mehr vorhanden ist, so bedeutet 
das, daB die Differenz .) — praktisch gleich Null zu nehmen ist und 
folglich, 

No = — F' (eo, 1) = — 


35 


Es erweist sich also hier, daB A, nicht ¢,, sondern einem Bruchteil der negativ- 
genommenen 4" endlichen Differenz der wahren Fehler e gleichzusetzen ist. 


Kin anderes Beispiel. Es liege eine Beobachtungsreihe ... Wy, Vor, 
wobei WV ungerade ist. Ein jedes u; bestehe aus einem “ glatten ” Element G; und 
einem “restlichen” ;, welches durch die Gleichung gegeben sei: »; =—e;=ui—G;. 
Es sei angenommen, da die “ glatte” Reihe G,, G.,... durch eine Gerade, deren 
Konstanten nach der Methode der kleinsten Quadrate bestimmt werden, genau 
auszudriicken sei. Letztere ergebe die Werte: wu’, ws’, ... Uy 


Dann erhalt man, nach den bekannten Regeln der Methode der kleinsten Quad- 
rate, fiir u;—u;, das heiBt fiir die scheinbare “ restliche ” Komponente, schlieBlich 
Biometrika xvu11 20 
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folgenden Ausdruck : 


N 
= ni 


2 2 


Kann die Reihe 7, ... ny als eine Reihe von empirischen Griéfen, die eine 
zufillige Variable mit einem konstanten Verteilungsgesetz bei N von einander 
unabhingigen Versuchen annimmt, angesehen werden, so ist die mathematische 
Erwartung* von u;’ gleich der mathematischen Erwartung von u;, die mathe- 
matische Erwartung von (u;— w;’) also gleich Null. Was aber die mathematische 


Erwartung von ane sae anbetrifft, so ergibt sich fiir diese nicht der Wert pe, 


sondern #2, Wenn die mathematische Erwartung des Quadrates der Ab- 


weichung der GréBSe 7; von ihrer mathematischen Erwartung bedeutet. Kann ferner 
die Komponente G,, G2, ... Gy nicht genau durch eine Gerade ausgedriickt werden 
oder sind die einzelnen 7; nicht vollkommen von einander unabhingig, so werden 
in der mathematischen Erwartung von (wu; alle Produkte »;n; von bis 
(u;—u; 
i=N auftreten. Eventuell kénnte dann die Korrektion des Ausdruckes ‘= V 


dieselbe GréSenordnung, wie er selbst, erlangen. Geht man vollends von der 
Berechnung einer Geraden zur Ausgleichung nach Parabeln héherer Ordnung iiber, 


N 
2 (u; — 
so wird, wenn diese Ordnung n ist, die mathematische Erwartung von VY 


gleich #2*, und die Korrektionsformel fiir (u;—u;’) mit jeder héheren 


Ordnung noch komplizierter. Ihre geringe Handlichkeit und die weiten Grenzen, 


* In der vorliegenden Arbeit bediene ich mich durchwegs der in England bis jetzt wenig populiren 
‘*russischen ” Methode der mathematischen Erwartungen. Einige Hinweise auf die Eigenschaften und 
Vorziige dieser Methode findet der englische Leser im Artikel meines Lehrers Prof. Al, A. Tchouproft’s, 
‘* On the Mathematical Expectation of the Moments of Frequency Distributions” (Biometrika, Vol. x11. 
November 1918, pp. 140—142), eine vollstiindigere Darstellung deren Lehrsiitze aber im 2. Kapitel von 
Prof. L. Bortkiewiez’s Iterationen (Prof. Dr L. v. Bortkiewicz, Die Iterationen. Ein Beitrag zur 
Wahrscheinlichkeitstheorie, Berlin, 1917, S. 30—69). 

Die Beschrinktheit des der vorliegenden Arbeit zugewiesenen Raumes, lift es, selbstverstindlich, 
nicht zu, den Rechenweg, auf welchem die meisten der unten angefiihrten Formeln abgeleitet wurden, 
auch nur anzudeuten, Dies durfte aber geschehen, weil deren Bestimmung, mit den wenigsten Aus- 
nahmen, eben kein Problem, sondern nur eine Rechenaufgabe darstellt, die, bei geniigender Akkuratesse, 
von einem jeden erledigt werden kann, der iiber gewisse Kenntnisse in der Theorie der mathematischen 
Erwartungen sowie in der Kombinatorik verfiigt. 

t Dieser Satz ist eigentlich schon von Gauf in seiner Theoria combinationis, Art. 38, bewiesen 
worden. Vergl. Czuber: Wahrscheinlichkeitsrechnung, 3, Aufi,, 1914, Bd. 1. 8. 347—349. 


4 
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in denen sich deren Grofe bewegen kann, erschweren die Anwendung der Methode 
der kleinsten Quadrate hier auBerordentlich *. 


(5) Fassen wir jetzt dasselbe Problem von einer anderen Seite an. Wir haben 
eben gesehen, daf zwischen der Differenzenmethode und den verschiedenen Reihen- 
Ausgleichungsmethoden insofern kein Unterschied besteht, als wie die eine so 
auch die andern die “ restliche” Komponente nicht unmittelbar bestimmen, son- 
dern nur deren gewisse lineare Funktionen ergeben kénnen. Es liegt daher der 
Gedanke nahe, zu untersuchen, ob die Differenzenmethode nicht als ein regel- 
rechtes Reihen-Ausgleichungsverfahren ausgebaut werden kénne*t. 


Es sei wieder wu, = G, +, Ue= Gz + No, ... Uy = Gy + ny, eine statistische Reihe, 
die aus einem “glatten” Element G und einer “restlichen” Komponente 7 besteht. 
Nehmen wir nun an, daf die Reihe G,, G,,... Gy sich genau durch eine Parabel 
(2k —1)-ter Ordnung darstellen li8t. Dann wird in der Reihe der 2k-ten endlichen 
Differenzen unserer u-Reihe der letzte Rest der Komponente G verschwunden sein 


und, folglich, wenn wir 6* als Symbol fiir die 2k-te endliche zentrale Differenz 
einfiihren, wird 
bu; = + = 


* Wohl der bedeutendste Schiiler Prof. Al. A. Tchouproff’s, Prof. N. S. Tchetverikoff, hat in einer 
Reihe von Aufsitzen (siehe die in Moskau erscheinenden russischen Zeitschriften ‘‘Fragen der Kon- 
junktur,” ‘‘ Statistische Nachrichten,” u.s.w.) die Stabilititsverhiiltnisse der russischen Ernten und die 
Zusammenhiinge zwischen Ernte und Kornpreis einer genauen Untersuchung unterworfen. Es diirfte 
nicht leicht sein, in der statistischen Litteratur anderer Linder gleichwertiges an Scharfsinn, Feinheit 
der angewandten Methoden und an Fleif} zu finden. Und dennoch befiirchte ich sehr, dah die Vorliebe 
Tchetverikoff’s fiir Parabelanpassung nach der Methode der kleinsten Quadrate ihn in einigen Fiilen 
zu Trugschliissen gefiihrt hat und da§f wenigstens einige von seinen ‘‘riitselhaften’’ Korrelatious- 
koeffizienten eben nur “spurious” sind. 

Hoher in dieser Hinsicht steht die Arbeit einer anderen Schiilerin Tchouprofi’s, des leider zu friih 
verstorbenen Friiuleins M. Winogradowa: ‘‘Der Alkoholkonsum in Rufland und die Ernte,” Petrograd, 
1916, erschienen in den Proceedings (Students Section) of the Econom. Departm. of the Petrograd Poly- 
technic Institute (russisch), 

Fri. M. Winogradowa gebrauchte erste Differenzen. 

t{ Die Grundidee der von ‘‘Student,” K. Pearson, E. S. Pearson, E. M. Elderton, B. M. Cave, 
A. Ritchie-Scott, J. Henderson, mir und andern in der Biometrika seit 1914 entwickelten Differenzen- 
methode nehme ich hier als bekannt an. Prof. Truman L. Kelley (Statistical Method, New York, 1923, 
pp. 272, 273) formuliert sie, nach “Student” und K. Pearson, folgendermafen : ‘‘ Given two series, 
#1, XQ, ...X, and yj, Yo, ... Y, between which there is an organic correlation, R, and a spurious corre- 
lation due to a time or location factor such that the two phenomena together result in an apparent, 
i.e. an obtained correlation, of r. The problem is to determine R. “Student” (1914) has shown that if 

X,+ bt, + + + ete. 

Xz + + + + ete. 

etc. 
and if yi= Bt, + + + ete. 

y2= Yo+ Btg+ + + ete. 

ete. 
in which X,, X2, etc., Y;, Yo, ete., are independent of time or location, then, if the parabolic equations 
in t terminate with some power t”, the correlation ry, is given by the correlation between A,, and 6,,, 
the two series of nth order differences, A; standing for the measures (;— 2), (*2— 23) ... 3 
Ao for the measures [(a— 22) — <3) — — and simi- 
larly Ag for third order differences ; Ay for fourth order differences, etc.; the 5’s having comparable 
meanings in the case of the y-series.”’ 


20—2 
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Nehmert wir jetzt an, unser Ziel bestehe darin, eine solche lineare Funktion 
F (uj, x41) 2u finden, daB genau F(G;, G;. Da nun 
— F = Gi — F (Gi, + ni — F (ni, 2x41); 
so fiihrt obige Annahme zwangsliufig zur Gleichung : 


— F =i — F 
wo im rechten Teil die Komponente G giinzlich fehlt. 


Nun haben wir eben gesehen, da6 unter den eingangs gemachten Pedingungen 
gerade die 2k-ten endlichen Differenzen der Reihe wu die von uns hier benétigten 
Eigenschaften besitzen : (1) lineare Funktionen von u zu sein und (2) keinen Rest 
der Komponente G mehr aufzuweisen. Das fiihrt uns zum naheliegenden Gedanken, 
u; — F (uj, x41) gleich Zé*u; zu setzen, wo Z einen Multiplikator bedeutet, dessen 
“vorteilhaftester” Wert noch gefunden werden mu8. Aus der gemachten Annahme 
folgt unmittelbar : 


F (Uy, = — uj. 


Das wire der gesuchte Ausdruck fiir F, den wir auch in folgender Form 
schreiben kénnen : 


Es bleibt nur noch, den “vorteilhaftesten” Wert fiir Z zu bestimmen. In 
Anlehnung an das bei der Methode der kleinsten Quadrate iibliche Verfahren, 
wollen wir Z der Bedingung: EF & [n;— 28” n;]? = minimum, unterwerfen, wo das 
i-1 
Symbol # die mathematische Erwartung bedeutet. Und daraus: 
N N N 
i=1 i=l i=1 


Wenn man die erste Ableitung davon nach Z gleich O setzt, so erhailt man fir 


Z den Ausdruck : 


N 
i=1 


N 
E ni 
t=] 


Z= 


Da die zweite Ableitung positiv ist, so ist der gefundene Wert von Z ein 
Minimum. Um fiir Z eine einfache Form zu gewinnen, miissen wir bei der Kom- 
ponente » gewisse Eigenschaften voraussetzen. Es sei wiederum angenommen, 
daB die Reihe 7, 7,,...y, empirische Gréfen darstellt, welche eine Variable mit 
einem konstanten Verteilungsgesetz bei V von einander unabhiingigen Versuchen 
ergeben hat. Die Form dieses Verteilungsgesetzes bleibe aber beliebig. Dann ist 
En," = En" =... bei beliebigem m und ferner Ln; . wenn 
i+j. Nach einigen Umformungen gelangen wir dann zum Ausdruck : 

ga (2k) 
(4k)! kt 

Derselbe Ausdruck wiirde sich auch ergeben, wenn wir als Bedingungs- 

gleichung fiir Z einfach = minimum, nehmen wiirden. Der von 


4 
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uns eingeschlagene Weg diirfte aber die Aniehnung an die Methode der kleinsten 
Quadrate klarer hervortreten lassen. 


Setzt man den gefundenen Wert von Z in die Formel fiir F (uj,41) ein, so 
erhalt man die endgiltige Formel des der Differenzenmethode entsprechenden 
Ausgleichungsverfahrens : 


(4k)! kl ke! 
Gebraucht man in dieser Formel 0, statt 7, als Subscriptum, und gibt man & 
sukzessive die Werte 1, 2, 3, 4, ... so erhalt man unmittelbar : 
bei k=1, = 4 [u. + + u_)], 
» k=2, ul = + 12(u, (w+ 


» = 5h, [181lu, + 75 (u, + — 30 (ue + + 5 (us + U_s)], 
ete. 


F (Uj, = Ui = 


Also genau dieselben Koeffizienten, die die Sheppard’sche Ausgleichung in dem 
Falle ergibt, wenn man sein n der Ordnung seiner Parabel (oder der Halfte der 
Ordnung unserer Differenz) gleich setzt! (vergl. Whittaker and Robinson, loc. cit. 
pp. 295—296). 

Will man also hier, bei Sheppard, die “restliche Komponente” fiir einen 
Annaherungswert der wahren Komponente 7 ansehen, so darf man der Differen- 
zenmethode auch nicht vorwerfen, daf sie nur lineare Funktionen der Komponente 
ergeben kénne. 

Gibt man, im Gegenteil, zu, daB die nach Sheppard berechnete restliche 
Komponente «;— u;’ nicht gleich y; sei (und wir glauben, daB man gezwungen ist, 
dies zuzugeben), so mu8 man die Differenzenmethode, als ein in gewissen Hin- 
sichten ebenbiirtiges Mitglied, in die Reihe der anderen Ausgleichungsmethoden 
aufnehmen. Meines Erachtens, ist eben das letztere geboten. 


(6) Um die weiteren Ausfiihrungen der vorliegenden Schrift verstindlich zu 
machen, ist es notwendig ihnen eine Darstellung der wichtigsten Formeln der 
Differenzenmethode vorausgehen zu lassen, Einige derselben werden hierbei zum 
ersten Mal der Offentlichkeit iibergeben. 


Eine gegebene statistische Reihe u,, w,... wy bestehe aus einer “glatten” 
Komponente @ und einer “restlichen” s, so daB u;= G;+s;. Es sei ferner ange- 
nommen, da$ durch die k-te endliche Differenz erstere Komponente ginzlich 
vertilgt werde und daf, folglich, A*u; = A*G; + A*s; = A*s;*. Wir besitzen dann 


* Begniigt man sich damit, da§ A*G;, ohne ganz zu verschwinden, nur einen gewissen, fiir die 
Praxis zu vernachlissigenden, Teil von A‘u; ausmachen kénne, so werden, auBer den durch eine ganze 
rationale algebraische Funktion (Parabel) bis (k-1)-ter Ordnung darstellbaren Reihen, auch verschie- 
dene andere, durch andere algebraische oder transzendente Funktionen darstellbare Reihen, ungefihr 


zum selben Resultat fiihren. So auch Yule’s Sinus-Reihen mit der Formel uj=4 sin (2* +7) 
im Intervall O<h<= 47 und T2h> fT. (Vergl. den Artikel von Yule im Journ. Roy. Stat. Soc., Vol. 
LXxxIv. p. 497 et seq.) 


Weiter unten werden wir noch die allgemeinen Bedingungsgleichungen feststellen, welche die Eigen- 
schaften der beim endlichen Differenzieren allmihlich an Bedeutung verlierenden Reihen der ‘‘Gruppe 
G” umschreiben. (Vergl. § 10.) 


| 
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(N —k) lineare Funktionen der NV Gréfen s,, s,... sy, von denen eine jede von 
der Form 

A*s; = 8; — Sinn + C2 Si+2— Sing + (- CF (6 
ist, wenn 


N-k Gleichyugen geniigen aber nicht zur Bestimmung von NV Unbekannten, 
und folglich, kénnen die GriBen s,, s,, ... sy, keinesfalls allein aus der Reihe der 
Funktionen A*s berechnet werden. Diese komplizierten Funktionen kénnen an 
und fiir sich auch durchaus nicht als Reprasentanten oder angeniiherte Werte der 
Reihe s gelten. In dieser Hinsicht haben G, U. Yule und W. M. Person ganz 
rechtt. Sie sollen es aber auch garnicht! Die einzige Bedeutung dieser Funk- 
tionen besteht eben darin, da man aus ihnen, bei Einfiihrung gewisser Hypothesen 
iiber die Beschaffenheit der urspriinglichen Reihe s, verschiedene Charakteristiken 
derselben berechnen kann. Nimmt man an, letztere kinne angesehen werden, als 
eine Reihe von empirischen Grifen, die eine zufdllige Variable mit einem kon- 
stanten Verteilungsgesetz bei N von einander unabhingigen Versuchen annimmt, so 
gestalten sich die Formeln fiir die gesuchten Charakteristiken besonders einfach. 

Als erste dieser Formeln sei der mittlere Fehler (Standard Deviation) ¢ 
bestimmt. 

Bedeutet das Symbol # wiederum die mathematische Erwartung, und gebraucht 
man die Bezeichnungen Prof. Al. A. Tchouproff’st: 


Mm, pee (8— Mm) (7), 
so ist, bekanntlich, der mittlere Fehler (Standard Deviation) : 


o = Vp, =VE(s— (8). 


* In der Theorie der endlichen Differenzen steht gewéhnlich A’u, fiir w2-w, und, iiberhaupt, 
Aku, fiir AF-1 uz,, — A*¥-1u,;.. Das macht wohl die Analogie mit der Differentialrechnung evidenter, die im 
Texte von uns gebrauchte Bezeichnung ist jedoch fiir den praktischen Gebrauch entschieden bequemer, 
da wenn die Reihe in einer vertikalen Kolonne angeordnet ist, man aus dem oberen das ihm unmittelbar 
folgende untere Glied subtrahiert und nicht umgekehrt. Letzteres kénnte eine unniitze Fehlerquelle 
abgeben. Dieselben praktischen Erwiigungen veranlaBten mich auch, vom Gebrauche der sogenannten 
zentralen Differenzen hier abzustehen, obgleich einige Formeln durch Einfiihrung der letzteren ein 
eleganteres Aussehen erhalten hiitten. 

An diesem Orte sei noch besonders auf die unbedingte Notwendigkeit der Kontrollen bei Berechnung 
der endl. Differenzen hingewiesen. Am bequemsten ist die Benutzung der Kontrollformel 


Aks;= Ak-1 81 Af 
i= 
+ Es bedarf garnicht der ziemlich weitschweifigen Ausfiihrungen W. M. Person’s, um die ‘‘theory 
of the tendency of signs of terms of higher differences to alternate” zu beweisen. (Vergl. sein ‘‘ On the 
variate difference correlation method and curve fitting” in Quarterly Publications of the American 
Statistical Association, New Series, No. 118 (Vol. xv.), June 1917, p. 612.) Es leuchtet ohne weiteres 
ein, daS wenn auch die einzelnen s; von einander vollkommen unabhiingig sind, schon die erste 
Differenz s;-8,,; mit der ihr folgenden ersten Differenz s;,,—8;,. negativ korreliert sein muB, und 
da Ats;,,, in welchem k von k+1 -inzelnen s dieselben sind, wie im Ausdruck fiir A*s;, mit diesem 
desto stirker negativ korreliert ist, je gréBer k genommen wird. 
Die Werte dieser Korrelationen sind weiter unten durch Formel (19) gegeben. 
t Loe. cit., p. 148. 
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Die Grofe o* = w, werde ich hier apriorische Streuung (Variance) nennen. 


Dem apriorischen mittleren Fehler o sind bei der oben gemachten Annahme 
die mathematischen Erwartungen zweier Formen des empirischen mittleren Fehlers 
a” gleich, die wir durch ein Subskriptum 0 von einander unterscheiden wollen : 


= (8; 
i=1 


N N 
= = si 
und o = —, wenn Sy) = 
Es ist also dann Eo) = Eo’ 
Fiihrt man die Bezeichnung ein: 
1 = (A* si)? 
i=1 
wo A*s; durch die Formel (6) gegeben ist und C,* (vergl. Form. 6a) gleich ea ; : 


so kann es—immer unter derselben Annahme einer zufalligen Variablen mit 


konstantem Verteilungsgesetz und V unabhingigen Versuchen—ganz strenge be- 
wiesen werden, dab 


Eo;'? = = py. 


Diese Gleichung bleibt hier genau, unbeschadet, ob N—k gleich 1 oder einer 
Million sei*. 


Ist so groB, man fiir seine Faktorial-Funktionen die Stirling’sche 
Formel anwenden darf, so kann (11) auch in folgender Form geschrieben werden : a 


Schon bei k=6 betriigt der Fehler, den man bei Anwendung dieser An- 


niherungsformel begeht, kaum 2,1 %/ des wahren Wertes, und bei k= 9—bloB 3 
14%. 
* Formel (11) dient nur zur Bestimmung der uns einzig interessierenden GréBe »,. Es, hat daher ul 
keinen Sinn, ihr eine Form zu verleihen, die der Formel (10) analog wiire, also etwa 
1 Nok / 1. 


Dadurch wird nur der Rechenweg komplizierter, aber der wahrscheinliche Fehler der Bestimmung nicht 
kleiner. Ebenso wenig Sinn hat es, zur Bestimmung von oc,” nicht gleich alle erhaltbaren Werte von s 
hinzuzuziehen, sondern, um die Zahl der A*s immer konstant bleiben zu lassen, mit jeder neuen 
endlichen Differenz je ein neues Glied s der Reihe hinzuzufiigen. Der Gewinn (Vereinfachung der 
Formeln) ist gering und die mittleren Fehler fiir die ersten Differenzen werden nur unniitzerweise 
vergréBert. Es ist ziemlich dasselbe, als wenn wir bei Anwendung der Methode der kleinsten Quadrate 
einen Teil der méglichen Normalgleichungen unberiicksichtigt liefen. 


N-k 

>> Ak s2 

\ 
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Eine ganz andere Frage ist es, natiirlich, wie weit die empirische Gréfe o;” 
sich in der Praxis von seiner mathematischen Erwartung p, entfernen kénne. 
Zur Lisung dieser Frage ist die Bestimmung des mittleren Fehlers von o,” 
erforderlich, und hier spielt, selbstredend, die absolute Grife von N—k eine 
ausschiaggebende Rolle. 


(92,4, die apriorische Streuung (Variance) von o;”, bestimmt sich aus der 
Forme! 


(62, k) = E (ox? — 2). 
Nach ziemlich langwierigen Umformungen ergibt sich hieraus (immer nur 
N 
unter denselben Annahmen iiber die Natur der Reihe s), wenn k = oe: 


wobei 


k~1 k—2 k-3 
Sw = > +2 (Cy! +3 (C,! (02 CY 
i=0 i=1 i=1 
Da es bewiesen werden kann, daf jedenfalls 
k > 25 x) 


so erhailt man, wenn man fiir gréBere (NV —k) die Ausdriicke von der Ordnung 


unterdriickt, angenihert : 


(e2,k) = N —k + k (13). 
Ist schlieBlich & so groB, daB fiir seine Faktorial-Funktionen die Stirling’sche 
Formel angewandt zu werden vermag, so kann man auch anniherungsweise 
schreiben : “3 
Nok Wok (13a). 


Schon bei k=6 betriigt der Gesamtfehler bei Anwendung dieser Formel 
etwa 3%. 


Formeln (12), (13) und (13a) gelten fiir ein beliebiges konstantes Verteilungs- 
gesetz der Variablen. Ist es jedoch “normal,” so besteht, bekanntlich, die Bezie- 
hung p,=3y,", und in obigen Formeln werden iiberall die ersten Glieder der 
rechten Seite verschwinden. Aus (13a) erhalten wir dann, zum Beispiel : 


2 
k) = 


N-k° 
Denkt man sich, also, k geniigend und N betrichtlich groB und das Vertei- 
lungsgesetz “normal,” so werden die mathematischen Erwartungen der empirischen 


(a2, 


* Ks ist also Siy=1; Siy=10; Sig) =138; S(4)=1940 ; S(g)=28130; S(g)= 414372, u.s.w. 
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Streuungen der GréBen o%42,... sich etwa so zu einander verhalten, 
wie Vk: Vk+1:Vk+2:.... 


In Tabelle I finden sich die bis zur 3°" Dezimalstelle genauen Werte, welche 
Formel (12) fiir die ersten 6 endlichen Differenzen ergibt. 


TABELLE LI. 
Ordnung der | k) fiir ein beliebiges konstantes k) fiir ein ‘‘normales” 
Differenz k Verteilungsgesetz der zufiilligen Verteilungsgesetz der zufil- 
Variablen S ligen Variablen S 
N N 
(¥-1)? (V-1)? 
(NV — 2,556) +(0,889 V— 2,111) po? (3,889 V—9,778) pr? 
| (W—2)? 
k=3 (MV 3,690) (1,620 — 5,790) (4,620 WV — 16,860) 
(NV -3) (V-3) 
(NW 4,792) (2,253 -10,633) yy? | (6,253 W—25,012) 
| 
(4 — 5,887) mat (2,819 W — 16,436) ps? | (5,819 WV ~ 34,004) ps? 
| (V-5)? | (V—-5)? 
k=6 (4 6,971) (3,335 W— 23,096) pa? | (6,835 WV — 44,008) ps? 
| -6) | (V-6) 


Aus Tabelle I erhalten wir die apriorischen Strewungen der GréBen o;*%. Um 
deren mittlere Fehler zu bestimmen, miissen aus den fiir erstere berechneten 
Zahlenwerten Quadratwurzeln gezogen werden. Die Gréfen pg, und pw, sind 
apriorisch und miissen beim Gebrauch der Tabelle in der Praxis durch empirische 
Anniherungswerte ersetzt werden. Dadurch entstehen Schwankungen, welche 
eigentlich ihrerseits mit Hilfe der mittleren Fehler der Streuungen o*(2,,) gemessen 
werden sollten. Die Verfolgung dieses Weges wiirde uns aber hier zu weit fiihren. 


(7) Formel (12) ergibt nur die apriorische Streuung (oder auch den mittleren 
Fehler) eines jeden 0; gegeniiber dem apriorischen u,. Sie besagt noch nichts 
iiber das gegenseitige Verhiltnis der GréBen ... Welche, wie leicht 
ersichtlich, stark positiv miteinander korreliert sein miissen. Ihre mathematischen 
Erwartungen sind wohl alle einander gleich und diejenigen ihrer Differenzen, 
folglich, gleich Null. 

In der Praxis werden jedoch die Differenzen dieser empirischen GréBen von O 
mehr oder weniger betrichtlich abweichen. Um ein MaB dieses Abweichens zu 


erhalten, wollen wir die apriorischen Streuungen der Differenzen o”,4,; — ox? 
bestimmen : 


E — ox?) — OF = E — fe) — (on? — 
= — + (ox? — py)? — 2E (oy? — po) 41 — fe) (14). 
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Die beiden ersten Glieder der rechten Seite ergeben sich direkt aus Formel 


(12), fiir das dritte erhalten wir, wenn k = =: folgenden Ausdruck : 


7 1-38 28" 
2k+2 
Con @*N-%-1 | (15) 
wobei 


k-2 


k+1 k+1 


bel. 28 
und jedenfalls : k-1)~ o (WW —k—1) 


Setzt man die entsprechenden Ausdriicke aus (12) und (15) in (14) ein, so 


erhalt man, wenn man alle GréfSen von der Ordnung Wi unterdriickt, fiir die 
ersten 6 Differenzen folgende Werte (s. Tabelle II). 


TABELLE IIt. 


Formel fiir die apriorische Streuung, bis zur Dieselbe, bis zur 
Ordnung fiir ein beliebiges konstantes Ordnung 
Verteilungsgesetz der zufilligen Variablen S abgerundet 
(ox"— a0") | W(V-1) V(W-1) 
| 0,2778 (4 — (0,2222 N+1,1111) ps? 0,2222 pup” 
(¥=1) (W—2) (W=1) (W-2) W-2 
| 0,2544 (u4— (0,1089 V + 1,0933) po? 0,1089 uo? 
| + (W—2)(W-3) W-3 
0,2096 (44 — (0,0673 V+1,0808) po? 0,0673 
(W—3) (W- 4) (W—3) (W-4) | W-4 
0,1873 (0,0468 V+ 1,0721) 0,0468 
(W=4)(V-5) (W= 4) (W=5) N-5 
E 0,1693 (0,0350 V + 1,0656) 0,0350 po? 
(W—5) (W—5) (W—6) N-6 


Es ergibt sich also, da8, im Allgemeinen, je gréBer die Ordnung der endlichen 
Differenz k, desto geringer die mathematische Erwartung des Quadrates der 
* Es ist also: S,'=1; S,’=15; S,’=242; S,’=3815; S,’=59724; ete. 


+ Formel (15) erlaubt es auch, den Korrelationskoeffizienten zwischen o,’” 


und o”,,, genau zu 
bestimmen. 


+ Fir ein ‘‘normales” Verteilungsgesetz verschwinden alle Glieder, welche y4—3y,” enthalten, und 
die einzelnen Griéfen der Tabelle unterscheiden sich von denjenigen in Formel (23) meiner Monographie 
“Uber ein neues Verfahren bei Anwendung der ‘ Variate-Difference ’-Methode” (Biometrika, Vol. xv., 


1923, p. 145) nur durch den Faktor 


Die Formel (21) fiir o? ist niimlich nicht 
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Differenz o%%4,—o,? ausfallt. Das ist auch ganz natiirlich, denn obgleich ihr 
méglicher minimaler Wert immer derselbe bleibt (— o;,? bei o%%4,=0), ist ihr 
Maximum durch + 2 " 7 gegeben, verringert sich also mit wachsendem k*. Der 


mittlere Fehler der Differenz o”,,,—o,%? bleibt jedoch endlich, solange », und V 
endlich bleiben. 


Ferner ist zu bemerken, daB die Differenz p,—3y,? sich hier blo® von der 


Ordnung erweist. Konnen bei betrichtlichem auch alle GréBen dieser 
Ordnung unterdriickt werden, so erhalten wir die Werte der letzten Kolonne der 
Tabelle II, welche, bis zur GréSenordnug 7 fiir ein beliebiges konstantes Vertei- 


lungsgesetz der Variablen gelten. Sie bestimmen sich, ganz allgemein, nach der 
Formel : 


| 2 
(16). 


-Ist k& so gro8, daB man die Stirling’sche Formel anwenden kann, so erhalt 
man hieraus: 

(16a). 
2(2k + (N-—k—-1) 

Schon bei & = 6 ergeben sich ganz annehmbare Resultate. 

Die Formeln der Tabellen I und II vervollstiindigen nur einander, kénnen 
sich aber gegenseitig nicht ersetzen. Will man feststellen, ob eine fiir die Theorie 
geniigende Stabilitaét der Reihe o;”, o%4,, erreicht ist, berechnet man 
die ersten Differenzen derselben und gebraucht die Formeln der Tabelle II. Ist 
es aber notwendig, die Grenzen der méglichen Abweichung des gefundenen stabilen 
o," von seiner mathematischen Erwartung—sagen wir, mit Hilfe des bekannten 


Tschebyscheff’schen Theorems—zu beurteilen, so wird man sich der GréSen der 
Tabelle I bedienen miissen. 


Ein anderes, in manchen Fillen interessantes, Charakteristikum derselben 
Reihe S bilden die Koeffizienten vom Typus: 


s; 
ke.\2 
2, 


w obei j eine beliebige ganze positive Zahl zwischen O und (NV — k — 1) sein kann. 


E E gleich, wie es dort fiilschlich angenommen wurde, sondern der Grife: 

und zeigt, nach Bestimmung seiner Quadratwurzel, welch einen Teil der Gréfe u, der mittlere Fehler 

der Differenz (2k+1) o’%,,,—(2k+1)c," ausmachen kann. Nur in diesem Sinne darf sie angewandt 

werden. Vergl. dariiber: O. Anderson, ‘Variate Difference Method,” in den Mélanges Pierre Struve, 

Recueil des Ecrits, présentés % M. Pierre Struve le 30. Janvier, 1925, Prague, pp. 26, 27 (russisch). 

* Durch die Formel dieses Maximums erkliirt sich auch, weshalb, bei konkreten Messungen, die 


Reihe der Differenzen ... — verhiiltniBmiBig oft auf dem Diagramm ein 
ganz hyperbelmiiBiges Aussehen erhiilt. 


| 
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Es kann niimlich bewiesen werden, da8 HR’; , desto niher zu 


1 
1 N-k 


herankommt, je gréBer NV ist, um jedenfalls im Grenzfall, bei sehr groBem N, mit 
diesem Ausdruck identisch zu werden*. Nun ist aber 


W-k-j E Sing =(— 1y Be und N (A 8;) = Me 
(18) 
Daher erhalten wir fiir ZR’; , den Grenzwert : 
k+j 
kik! 
ER‘ = (— = (— 19), 


* Der empirische Korrelationskoeffizient und die meisten ihm ihnlichen Gebilde besitzen die 
Eigenschaft, daB ihre Nenner und Zihler versechiedene Funktionen derselben gegebenen Werte Z,, Z2,...Zy 
und 7’, Z2',... Z’y, sind. Die Wahrscheinlichkeit des Auftretens einer bestimmten i-ten Kombination 
von N zusammengehérigen Paaren Z, und und Z,', ... Zy und Z’y, eines, sagen wir, Korrelations- 


koeffizienten sei p;. Und es seien nur n verschiedene Kombinationen dieser Paare, zu N in jeder, 


méglich, so daB 


Bezeichnen wir einen Koeffizienten, entstanden aus der i-ten Kombination der Paare, durch r;= - , 80 
i 


ist es ersichtlich, daf unter den gemachten Annahmen die Zibler x nur n verschiedene Werte: x1, v2, ... %, 
mit den entsprechenden Wahrscheinlichkeiten p; , po, ... p, erhalten kéunen; ist aber x2; gegeben so ist 
nur ein ganz bestimmtes einziges y; méglich, dessen bedingte Wahrscheinlichkeit also gleich 1 zu setzen 
ist, so da, folglich, auch die Zihler y ebenfalls nur n Werte: y;, y2, ... yy, mit denselben Wahrschein- 
lichkeiten p,, po, ... Pn annehmen kénnen. Dieselben Wahrscheinlichkeiten bestehen schlieSlich auch 
fiir die 1; selbst, so daB, also : 


Ex — 
Ey PryitP2y2t 


x. x 
1) Er=E-=p; 


Setzt man — Er=¢;, oder x,=y; Er+y,e;, 80 verwandeln sich obige Ausdriicke in folgende : 
i 


(1) Br. pet 2 + pees (und folglich ne 


n n n 

Er. = € yipie 

Ey ky y Ky 


Wir sehen also, dab, damit = notwendigerweise auch > gleich Null sein wenn 
i=1 


man, natiirlich, vom Falle Ey =0 absieht. 
Wenn die Grifen y,, yo, ... yx mit wachsendem N demselben Grenzwert (sagen wir, Ey) zustreben, 
so kénnen hier die einzelnen y schlieSlich als einander gleich angesehen und vor das Summenzeichen 


genommen werden. Dann erhiilt man auch vermége der Beziehung 2 pieg=O: 
i=1 


Mit wachsendem N kommen also dann die Ausdriicke Ey und = sich immer niher, um im Grenz- 


falle einander gleich zu werden. 


2 

Bes. 

: 

| 
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und folglich : 


k(k—1) — 1) (k —2) 
(19a). 


Es ist méglich, zu beweisent, wenn den eben angegebenen Wert 
besitzt, auch unbedingt wenn, ferner, ER’,,, ER’sx,... 
die in Form. (19) angegebenen Werte annehmen, so ist auch, ganz allgemein, 


Eo = Ko =...=Eoe 


Folglich, kénnen die Koeffizienten R’;,,, wenn ein entsprechendes Kriterien- 
system aufgebaut ist, die schwierige Berechnung der héheren endlichen Differenzen 
in einigen Fallen ersetzen und dadurch die notwendige Rechenarbeit erleichternt. 


(8) AuBer der statistischen Reihe u,, we,... wy, sei noch eine andere Reihe: 
V1, Uy, gegeben, Sie bestehe ebenfalls aus einer “glatten” Komponente 
und einer “ restlichen” s’, so daB v; = G/ + s;.. Es sei festgestellt, daB in der k-ten 
endlichen Differenz keine Reste von G’ mehr verblieben seien, so dab A*v; = A*s;’. 
AuBerdem sei angenommen, daB auch die Reihe s’ als eine solche Reihe angesehen 
werden kénne, welche eine zufillige Variable mit einem konstanten Verteilungs- 
gesetze bei V von einander unabhiingigen Versuchen ergibt. 


Fiihrt man, in Anlehnung an Prof. Tchouproff §, die Bezeichnungen ein : 


Mo, = Es! (und folglich : = Es%, mo, = 


(9) 
mon = Ess"; 


= (8; myo)? (8; — mon)’ (und folglich: (20), 
= E (s mo)". = E (s' = mon") 


= (si — mio) — Mon)" 


N-k-j 


* Um = Ats;A*s;,; zu bestimmen, ist es hauptsiichlich notwendig, den Satz : 


1 
N-k i=1 
i=0 
Theorie der Kombinatorik etwas neues zu sagen) und findet sich, zum Beispiel, in Netto’s Lehrbuch 


k 2k 
der Kombinatorik. Am leichtesten ist er aus der Betrachtung der Identitiit (1+ t)* (147) (1+?) 


zu beweisen. Dieser Satz ist nicht neu (es diirfte iiberhaupt nicht leicht sein, in der 


k-j 
zu beweisen. Entwickelt man niimlich die linke Seite, so ist: Ch der Koeffizient vor ent- 
i=0 


wickelt man die rechte Seite, so steht vor t/ der Koeffizient: — . 
+ Vergl. O. Anderson, Uber ein neues Verfahren, etc., 8. 138—140. 
{ Niiheres dariiber siehe in der eben zitierten Schrift. Die dort benutzten Kriterien = 7% und o-1 
bestimmen sich aus #1 | 


g Al. A. Tchouproff, “‘ Aufgaben und Voraussetzungen der Korrelationsrechnung” (Nordisk Statistisk 
Tidskrift, 1923, Bd. 2, Haft 1), 8. 52, Anhang. 


| 
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ferner die Bezeichnungen : 


N-k 
> | 
(21), 
N-k-j 
1 Aks; A i+j 
4=] 
so kann es leicht bewiesen werden, daB: 
Ep, = 


wenn nur s; und s’;,;, bei jedem j + 0, vollkommen unabhingig von einander sind und 
folglich = 0 bei j +0. 


Unter denselben Bedingungen wird auch : 


ky E - — fin)’ = er (hy 
2k 
+ Ga k 


Der Aufbau dieser Formel ist genau derselbe, wie bei (12). 


Nur mu8 man 
Min + Morr 
2 


Statt me’, und (p22 — — statt setzen. 


Letzterer Ausdruck verschwindet ebenfalls, wie (4,— 3y,%), im Falle einer “nor- 
malen” Korrelationsfliche, wo die bekannte Relation ry. = 1+ 2r%, besteht*. 


Mit den soeben erwahnten Substitutionen kénnen also auch hier die Formeln der 
Tabelle I benutzt werden. 


Mit denselben Substitutionen sind ebenfalls die Formeln der Tabelle II auf 


die mathem. Erwartungen der Quadrate der Differenzen: — anwendbar. 


Desgleichen erhalten wir ferner : 


Ep; = Ep’ = Ep’ 


wenn: 
(-)) (+1) (-2) (+2) (+3) 
kik! 
23). 
*S. unten: Formel (24). Bei Anwendung letzterer Formel kann man (22) auch folgendermafen 
darstellen : 


+ O. Anderson, Uber ein neues Ve rfahren, etc., 8. 138. 


tay. 
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Definiert man: 


= und folglich =—————_) ........< 


2/0 
= int 1/0) (8 i45 — Mon 
WV 2 — my). (s; — mon)? 
= 
1 ' , 
V-j (8: — — 8m) 
N 
8; 
wenn sy) = *, und ferner : 
Pj (k) 


(wobei ,o,? bei Anwendung der Formel (11) auf die Reihe s,, s., ..., und .o,? bei 


Anwendung derselben Formel auf die Reihe s/’, s,.’,... entsteht),—so kann man 
leicht beweisen, daB 
Ep, (k) = = 


wenn man nur davon absieht, dab Ep, y blo& in erster Annaherung dem Ausdruck 
Ep; 
gleich gesetzt werden kannt. 


Was die GroBe 0°), 4) = E (pom — rin)? anbetrifft, so ergibt sie sich—in erster 
Anniherung, fiir eine “normale” Verteilung und mit allen Reserven, die durch 
den neuesten Stand der Forschung geboten sind,—aus der Formel : 


2 2k 
2 (l—riy) | ak k | 
Auch hier kénnen, also, die Formeln der letzten Kolonne der Tabelle I benutzt 


(l—ri,) 
werden, wenn man dort durch 


ersetzt f. 


* Wenn j negativ ist, so mu, statt WS , tiberall WG genommen werden. 


+ Vergleich hieriiber besonders die neuesten (russischen) Arbeiten Prof. A. Tchouproff’s : Uber diz 
mathematische Erwartung des Quotienten zweier gegenseitig nicht unabhiingigen zufilligen Variablen, 
und Das Ausgangsproblem der mathematischen Theorie der verschiedenen Verfahren fiir die statistische 
Untersuchung der Zusammenhiinge zwischen zwei zufilligen Variablen, Ferner E. Slutsky, Uber einige 
Korrelationsschemen und iiber den systematischen Fehler des empirischen Korrelationskoeffizienten. 

+ Werden demzufolge die vor N stehenden Koeflizienten dieser Kolonne durch 2 dividiert, so erhiilt 
man ganz richtig die von A. Ritchie-Scott im x1. Bande der Biometrika (‘‘ Note on the probable error of 
the Coefficient of Correlation in the Var. Diff. Corr. Method”) berechneten Funktionen : 2¢(m)-1. 
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Ist schlieBlich NV und k geniigend gro8, so kommt man auch zur Formel : 


Die Annahme der gegenseitigen vollkommenen Unabhingigkeit aller s; und s;, 
ausgenommen den Fall i=j, kann aber lange nicht immer gerechtfertigt werden. 
Es ist sehr wohl méglich, daB, obgleich jede der beiden zufilligen Variablen ein 
konstantes Verteilungsgesetz aufweist und bei jeder die einzelnen Versuche von 
einander vollkommen unabhiingig sind, es dennoch zwischen verschiedenen s; und 
8:4; 2ine gewisse Korrelation bestehen kénne. So vermag man, zum Beispiel, aus 
der Reihe s,, s., s,, folgende zwei Reihen zu bilden: 


+& 8, +85 +8, 
= 84 +85 +8, Yo = 8, +83 + 
= 8; + 5g +S, und Ys = 8; +8 + 
89 + $y + Ys = 819 + S14 + Si, 


Wenn die s von einander unabhingig sind, so sind auch alle # von einander 
unabhingig, und ebenso alle y. Und trotzdem hat jedes y; je eine gemeinsame 
Komponente nicht nur mit jedem #;, sondern auch mit a, und a+». 


In solchen Fallen (und sie brauchen nicht selten vorzukommen) wird also 
weder die Reihe der Koeffizienten Ep, Epo 


... konstant, noch wird einer 
derselben 71), ergeben. 


Um hier die Differenzenmethode anzuwenden, mu8 man entweder zu E. S. 
Pearson’s Funktionen $(n, po, p)* oder, was auf dasselbe hinauslauft, zu den von 
mir im Artikel “Uber ein neues Verfahren,” ete.t, entwickelten Formeln greifen. 
Der Gebrauch letzterer wird weiter unten an einigen konkreten Beispielen illus- 
triert. Hier seien sie nur des Zusammenhanges wegen angefiihrt. 


Bei gréferen NV und unter den Voraussetzungen des § 10 (siehe dort) kann 
mit ganz guter Anniherung gesetzt werden: 


2k 
= (Cy, — 20 Ry + — Re} (29), 
2k 
wenn R; = (29a), 


i=1 
1 (8: sun} 
und ,o’ und ,o’ die auf die Reihen s und s’ angewandte Formel (10) bedeuten. 
* K. Pearson and E. M. Elderton, ‘‘On the Variate-Difference Method” (Biometrika, Vol. xiv. 


March 1923, p. 294), Form. (xviii) und ff. 
+ Loc. cit., pp. 136, 137. 


2 — 
* 
A 
P 
Sigg 
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(9) Gehen wir nun weiter. Angenommen, die empirischen Formeln fiir 

Pes P Powes Poet)» ete. 
entsprechen “m. B.,” das hei8t : modo Bernoulliano*, dem Schema einer zufalligen 
Variablen mit einem konstanten Verteilungsgesetz und N von einander un- 
abhingigen Versuchen. Ist dadurch schon bewiesen, daB die urspriingliche Reihe 
Us, +.. Uy eine solche Komponente wirklich besitzt? Ist es itiberhaupt sicher, 
daB die gefundenen Werte, wenn auch ebenfalls nur “modo Bernoulliano,” die 
gesuchten sind? Und wenn nicht, so was haben wir denn eigentlich erreicht ? 


Die erste und zweite Frage sind entschieden zu verneinen. Es wiire naiv zu 
glauben, daB die Differenzenmethode oder eine beliebige andere Methode imstande 
waren, das Kunststiick zu vollbringen,—ohne irgendwelche apriorische Kenntnisse 
iiber die zu zerlegende Reihe zu besitzen (iiber solche Kenntnisse verfiigt man 
in der Praxis nur in den seltensten Fallen) und ohne mehr oder weniger gewagte 
Hypothesen aufzustellen—aus NV gegebenen Gleichungen vom Typus u;= G; + si 
ganze 2N Unbekannte (“G” und “s”) zu bestimment. Ohne derartige Hypo- 
thesen kann man eben beim Zerlegen statistischer Reihen nicht auskommen. 
Aber gerade in dieser Beziehung erméglicht die Anwendung der Differenzen- 
methode, einen Schritt weiter zu tun, als es sonst in der Regel iiblich ist, denn bei 


ihr wird das evident gehalten, was bei anderen Methoden nur im stillen und 
unbewuBt hineingeschmuggelt wird. 


(1) Eine jede gegebene statistische Reihe von N Ghedern kann durch eine 
Parabel, nicht héher als (V — 1) Ordnung genau wiedergegeben werden. (2) Eine 
jede gegebene statistische Reihe kann durch eine Fourier-Serie oder durch eine 
andere trigonometrische Funktion genau dargestellt werden. Je héher die Ordnung 
der Parabel, je griBer die Zahl der Parameter in einer trigonometrischen Funktion, 
desto genauer wird sich, im Allgemeinen, die berechnete “ausgeglichene” Kurve 
den Gliedern einer gegebenen Reihe anpassen und desto geringere Werte werden 
die scheinbaren “restlichen Komponenten” aufweisen, bis sie, endlich, ganz ver- 
schwinden. Wo hat hier der Forscher Halt zu machen? Wo, bei welcher Potenz 
der Parabel, bei welcher Parameter-Zahl der trigonometrischen Formel hat er 
Grund festzustellen: dies hier ist ungefiihr die “glatte” Komponente und das 
dort diirfte nur eine Funktion der wahren “ restlichen Komponente ” ausmachen ? 
So naheliegend eigentlich diese Frage ist, kann man doch nicht behaupten, dab 

* Ein Ausdruck Prof. W. Romanowsky’s, der in der russischen statistischen Litteratur sich ein- 
zubiirgern scheint. 


+ Wie es die neueren Untersuchungen Prof, Al. A. Tchouproff’s und eines seiner Schiiler (J. Morduch) 
erwiesen haben, kann man, iiberhaupt—sogar aus dem Studium einer einfachen (nicht zusammenge- 
setzten) statistischen Reihe heraus—lange nicht alle Higenschaften der ihr zu Grunde liegenden ziifalli- 
gen Variablen absolut sicher bestimmen. So, zum Beispiel, ist es direkt unméglich, den Fall der 
gegenseitigen Unabhiingigkeit der an einer zufilligen Variablen angestellten Versuche von einigen 
anderen Fiillen, etwa vom Falle einer “uniformen Verbundenheit” der Versuche, zu unterscheiden. 
(Vergl. Al. A. Tchouproff, “ Ist die normale Stabilitit empirisch nachweisbar?” in Nordisk Statistisk 
Tidskrift, Band 1, 1922, Hiif. 3, 4.) Es liegt hier sogar kein bescheidenes “ignoramus,” sondern 
geradezu ein anmafendes “‘ignorabimus”’ vor. 
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sie bis jetzt eine eindeutige Antwort erhalten hat. Viele Forscher machen eben 
gerade dort halt, wo es in ihre Theorien am besten pat, oder wo ihnen die 
Rechenarbeit beschwerlich zu werden beginnt. Und da die menschliche Geduld 
eine stark verinderliche Gréfe ist, so bleibt der eine schon bei einer Geraden 
stehen und geht der andere bis in die hohen Parabelordnungen hinein. 


Die Differenzenmethode erlaubt hier eine objektivere Stellung einzunehmen. 
Sich ihrer bedienend, kann der Forscher etwa folgendes feststellen. Von einer 
gewissen k-ten Differenz angefangen, sind bestimmte Koeftizienten fiir eine oder 
fiir mehrere statistische Reihen ungefihr stabil geworden. Das kdnnte bedeuten, 
daB die “ glatte” Komponente angenahert durch eine Parabel (k — 1)-ter Ordnung 
darstellbar sei, und das kdnnte ferner bedeuten, die “ restliche Komponente 
als eine zufallige Variable mit konstantem Fehlergesetz und gegenseitiger Unab- 
haingigkeit der einzelnen Versuche angesehen werden kénne. Und je linger die 
Reihe der stabilen Koeffizienten (“modo Bernoulliano,’ natiirlich),—desto wahr- 
scheinlicher der SchluB. Oder: die Reihe der Koeffizienten wird nicht stabil; es 
ist also unwahrscheinlich, die “restliche” Komponente eine derartige zufallige 
Variable sei; wahrscheinlicher sei, daB sie diese oder jene anderen Eigenschaften 
besitze. Oder: die mittleren Fehler sind zu groB und das Material laBt ver- 
schiedene Deutungen zu; es ist, folglich, fiir eine Zerlegung in einzelne Kom- 
ponenten iiberhaupt wenig geeignet, u.s.w.* 

Haben solche Feststellungen theoretischen Wert? In manchen Fiillen doch 
wohl! Kénnen sie dadurch widerlegt werden, da8 man etwa beweist, da8 kurz- 
periodische Sinus-Komponenten unerklairbaren Ursprunges die zufallige Kom- 
ponente in gewissen Fiillen darstellen oder verdecken kénnen? Ich glaube nicht. 
Denn, erstens, ist die Annahme der Existenz einer zufalligen Beobachtungs- 
komponente mit gegenseitiger Unabhiangigkeit der Glieder doch schlieBlich die- 
jenige Hypothese, auf welcher bis jetzt, trotz aller Kritik, der gréBte Teil der 
Fehlertheorie aufgebaut ist; und, zweitens, ist die vorgeschlagene Hypothese 
jedenfalls nicht besser begriindet. Kein Forscher verwirft aber eine alte Hypothese, 
die einen gegebenen Erscheinungskomplex befriedigend erklart, nur deshalb, weil 
derselbe auf eine andere, nicht weniger kiinstliche, Weise auch erklirt werden 
kénntet. 


* Die Frage, ob das gegebene Material, bei Annahme einer bestimmten Formel fiir die “‘ glatte” Kom- 
ponente, noch die Hypothese der Anwesenheit einer ‘“ zufiilligen” restlichen Komponente iiberhaupt 
zulaBt, wurde bis jetzt meistenteils doch wohl nur von denjenigen Forschern untersucht, welche bei 
Sterblichkeitstafel-Ausgleichungen das Lexis’sche Kriterium Q? benutzten. So, zum Beispiel, in RuB- 
land von Herrn B. Jastremsky (seine Methode zur ‘‘ Auffindung des sich veriindernden Niveau’s einer 
statistischen Reihe” liuft letzten Endes auch auf die Berechnung von Q? hinaus). Es fragt sich nur 
ob der Lexis’sche Divergenzkoeffizient fiir einen solechen Gebrauch wirklich geeignet ist (vergl. unten 
§ 12). 

+ Und vollkommends geht es nicht an, nur auf Grund der theoretischen Méglichkeit eines stéren- 
den Einflusses von, sagen wir, Yules kurzperiodischen Sinus-Komponenten, behaupten zu wollen, daBh 
diese auch wirklich stéren. Ihre Existenz miiBte zu allererst nachgewiesen werden, was bis jetzt 
einwandfrei noch nicht gelungen ist. Ein drastischer Vergleich. Wenn ein Meteorstein auf eine Loko- 
motive herabfillt, so diirfte es voraussichtlich ein Eisenbahnungliick geben. Folgt aber daraus, dab 
man in der Eisenbahn nicht fahren soll, weil Meteorsteine auf Lokomotiven fallen ? 


: 
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(10) Zur Kontrolle, untersuchen wir jetzt die Frage, was man mit Sicherheit aus 
einem bestimmten Verhalten der Gréfen o'%,4, u.s.w. iiber die Beschaffenheit 
der restlichen Komponente folgern kann. Wie ein jedes “ umgekehrtes” Problem, 
ist diese Frage schwieriger zu behandeln, als die “ direkte.” 


Das denkbar allgemeinste stochastische Schema zur Darstellung der restlichen 
Komponente diirfte wie folgt lauten: “An N zufalligen Variablen—S,, S,, ... Sy— 
werden N Versuche vorgenommen—je ein Versuch an jeder derselben. Die 
Variablen migen beliebigen Verteilungsgesetzen folgen und unter einander in belie- 
bigen Abhiingigkeitsbeziehungen stehen*.” Wie es aber Professor A. Tchouproff 
selbst gezeigt hat+, kommt aus einem derartigen Schema nichts rechtes heraus, 
da, ohne bestimmte Annahmen iiber die Verteilungsgesetze und die gegenseitige 
Verbundenheit der Versuche, die Formeln immer betriichtlich mehr Unbekannte 
aufweisen werden, als es verschiedene Gleichungen aufgestellt werden kénnen. 
Fiihren wir deshalb noch 2 beschrinkende Bedingungen ein. Erstens, miégen die 
mathematischen Erwartungen beliebiger Potenzen aller Variablen konstant sein : 


Es,* = =... Esy*, bei h=1, 2, 3,... 


und zweitens, mége die Differenz 


Es; Es; Es;,; = ff (s; Es) (8:4, Es), 
von ¢ unabhingig und bloB von j abhiingig seint. 


* Vergl. Al. A. Tchouproff, ‘‘ Zur Theorie der Stabilitit statistischer Reihen,” Skandinavisk Aktuarie- 
tidskrift, 1919, S. 203. 

+ Al. A. Tchouproff, ‘On the mathematical expectation of the moments of frequency distributions 
in the case of correlated observations,” Metron, Vol. 1. N. 3—4, 1923, pp. 7, 8. 

+ Vergl. Tchouproff, ‘‘Zur Theorie der Stabilitit, etc.,” S. 209. O. Anderson, “Uber ein neues 
Verfahren, etc.,”S. 137. (Die beschriinkenden Bedingungen sind in der letzten Schrift etwas allgemeiner 
gehalten, als oben. Der Gewinn ist aber nicht besonders groB und verdunkelt nur die Formeln.) Czuber 
regt ihnliches an. Auch bei A. A. Markoff finden sich iihnliche Gedankengiinge. Dadurch, daB nicht 
alle Es,s;,;— Es, s;,; bei beliebigen j einander gleich gesetzt werden (was zu einer ‘‘ uniform-verbundenen 
Reihe ”’ fiihrt), sondern daB ‘‘Ketten” gebildet werden : 


Es, Es, E8983 Ess = E8384 Es, Es, =... Esy_\8y- 
Es, 83 — Es; Es3= Eso84 — Eso Es4= E8385 -- Es3 Es5=...= Esy_98y— Esy_g Esy, 
Es) 814; Es, — E89,;=...= E8y_j8y — Esy_jEsy, 


Es\8y_ — Es, Esy_\ = Esosy— Esq Esy, 


gewinnt man einen héheren Grad der Allgemeinheit und nihert sich den in der Praxis zu beobachtenden 
Verhiiltnissen. Man denke, zum Beispie], an Messungen, die an einer Reihe von Generationen unter- 
nommen werden: Vater—Sohn—Enkel—Urenkel, u.s.w.; andererseits—an Temperaturkurven eines 
Malaria-Kranken, an Auftretungsdichtigkeiten gewisser Insekten mit einer mehrjihrigen Verwandlungs- 
dauer, wie, zum Beispiel, Maikiifer (Melolontha vulgaris), u.s.w. 

In der Theorie der Stabilitit statistischer Reihen werden gewohnlich ganz andere beschriinkende 
Bedingungen eingefiihrt. -Man nimmt, niimlich, in der Regel an, da an jeder der Variablen S je k 
Versuche vorgenommen werden, wobei die Reihenfolge der Variablen durch das Los bestimmt wird. 
Auf eine in der Zeit sich veriindernde statistische Reihe angewandt, bedeutet das die Annahme einer 
serienweisen Veriinderung der Grundwahrscheinlichkeiten, also von Beobachtungsreihe zu Beobach- 
tungszeihe, so daB wiihrend einer solchen eine konstante Wahrscheinlichkeit herrscht (vergl. Czuber, 
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Bezeichnet man noch 


£ (8; — Es) (8:4; — Es) 
E (s;— Es) 


und denkt sich diese Koeffizienten in der Reihenfolge 


, (folglich, ist r,=1) ....... 


geordnet, so erhalt man leicht die beiden apriorischen Formeln, welche bei den 
gegebenen Annahmen genau sind, den empirischen Anniherungs-Formeln (28) 
und (29) zu Grunde liegen und, schlieBlich, diejenigen Bedingungsgleichungen 
ergeben, welchen die Koeffizienten 7; geniigen miissen, damit genau 


Eo;? = Eo", 


N-k 
= 1) A*r,,; und = p,(— r, ...* 


Die genannten Bedingungsgleichungen kénnen in zwei verschiedenen Systemen 
dargestellt werden, die auf einander unschwer zuriickzufiihren sind : 
bei i= 1, —7,+7,=0 ) 
» t=2, —5r,+8r,—37r,=0 
» t=3, —7r, + 14r,— 9r, + 2r7,=0 
» t=4, — 42r, + 96r, — 81r, + 32r,—5r,=0 


bei — V’r, =0 
» =0 
» t=4, —10V'r,— 150" — 1207, 5V" =0 ...(33a). 


Das umgekehrte Symbol V wurde deshalb eingefiihrt, weil die Reihe (31) 
hier als in umgekehrter Richtung (von rechts nach links) differenziert gedacht 


loc. cit., 11. 8. 42); demgegeniiber setzt das von uns hier angewandte Schema eben eine stetige und 
sozusagen gesetzmiiBige Veriinderlichkeit der Grundwahrscheinlichkeiten voraus. 

Es lift sich natiirlich dariiber streiten, welches der beiden stochastischen Schemen allgemeiner und 
weniger gezwungen ist. Wir, wenigstens, glauben, daf sie als ziemlich ebenbiirtig einander gegeniiber- 
gestellt werden kénnen. Jedenfalls fiihrt unser Schema, auf den Lexis’schen Divergenzkoeffizienten 
angewandt, zu einigen ganz interessanten und vielleicht auch ziemlich unerwarteten Resultaten. 
Vergl. hieriiber unten § (12). 

* 0, Anderson, Uber ein neues Verfahren, etc., 8. 142 und 143, 
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wird ; es ist also V’r, gleich 7, —7, und nicht r;—7, gleich r,—2r.+7, und 
nicht 7,—2r,+74, V’’r, gleich r, — 3r, + 37; —7, und nicht r, — 37, + 38r_, — r_s, 
u.s.w. Diese Bezeichnung ist mit der von den Aktuarien gewéhnlich gebrauchten 
nicht zu verwechseln. Es ist noch zu beachten, daB die unterste Formel in (33) 
dem Falle i =k —1 und in (33a) dem Falle i= entspricht. 


Ist schon Eo, = Eo,” = =...= so entspricht das dem Falle i=1, 
und alle Bedingungsgleichungen (33) und (33a) miissen erfiillt sein. Ist erst, 
zum Beispiel, Ho,” = Eo,? =... = Eo;?, so entspricht das dem Falle i=3 und alle 


Gleichungen des Systems (33) miissen erfiillt sein, ausgenommen die beiden ersten, 
wo die linken Seiten nicht Null zu sein brauchen. Ebenso im System (33a). 
Andererseits, sind die Bedingungsgleichungen von einem gewissen 7 abwiirts 
erfiillt, so miissen auch die Beziehungen bestehen: = 07% 4.=.... Reihen, 
fiir welche solche Beziehungen gelten, bezeichne ich als zur “Gruppe R” gehirig. 


Der Inhalt der’ Bedingungsgleichungen (33) und (33a) ist nicht so leicht zu 
erschépfen, und wir werden uns hier nur mit dem Nétigsten befassen. 


Ist ¢ gleich 1, so verfiigen wir iiber /—1 verschiedene Gleichungen zur 
Bestimmung von k Unbekannten: 7,, 7, ... 7. Wir erhalten aus ihnen leicht : 


das heift, die Bedingungen einer “uniformen Reihe” Tchouproff’s. 


Ist 7 gleich 2, so besitzen wir nur (k — 2) Gleichungen fiir / Unbekannte und 
kommen zum System : 


7 9 
11 2k —1 


Die Reihe 7, 73, ... wird, folglich, durch gewisse Parabeln 2-ter Ordnung 
genau ausgedriickt. Wie man sieht, werden dabei die Differenzen 1; — 1;4,, welche 
durchwegs dasselbe Vorzeichen behalten, mit wachsendem 7 immer griéfer. Ein 
derartiges Verhalten der Reihe 7 entspricht so wenig demjenigen, welches wir 
von der Wirklichkeit erwarten kénnen, da8 es wohl bei weitem wahrscheinlicher 
bleibt, da8B 7,-—7,=0. Wir kehren, folglich, auch hier zum System der “uniformen” 
Verbundenheit der Versuche zuriick. 


Ist, iiberhaupt, 7 gleich j, so verfiigt man iiber nicht mehr als / —j Gleichungen 
zur Bestimmung von k Unbekannten. Es kénnen, folglich, nur je (—j) Un- 
bekannte durch gewisse lineare Funktionen der iibrigen 7 ausgedriickt werden. 
Es ist aber méglich, ganz allgemein zu beweisen, daB auch in diesem Falle die 
Reihe (31) durch Parabeln jedenfalls nicht héher, als (2j)—1)-ter Ordnung dar- 
stellbar sein miiBte*. Je gréfer die letzte erreichte Differenz k im Vergleich 
zu (2) — 1) ist, desto weniger wahrscheinlich wird ein so gesetzmifiges Verhalten 


* Ebenda, S. 146 Anmerkung. 
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der Reihe der r, und desto wahrscheinlicher, also, die friihere Annahme einer 
“uniformen” Reihe*. 


Wenden wir uns jetzt den Bedingungen zu, unter welchen die Reihe Ho,?, Eo,”, 
Ka;?, ... nicht konstant werden kann. 


"Js kann bewiesen werden+, daB wenn, von einem gewissen 7 angefangen, in 
System (33) und (33a) die linken Seiten aller Gleichungen >0 werden, auch 
< < ... sein miissen. Reihen, fiir welche solche Beziehungen 
bestehen, gehéren zur “ Z-Gruppe.” 


Und umgekehrt, wenn diese linken Seiten in (33) und (33a), von 7 angefangen, 
alle < 0 sind, so haben wir, im Gegenteil, Ha;* > Eo”;,, > Eo";,.>... Reihen mit 
solchen Beziehungen zwischen ihren Streuungen gehéren zur “ G-Gruppe.” 


Betrachten wir anfangs den zweiten Fall und begniigen wir uns mit der Unter- 
suchung nur derjenigen Reihen, fiir welche von Anfang an 


> Eo," > Eo,*> EHo;?>... 


(Im Verhaltnis zu den Streuungen der “ R-Gruppe” werden also die Streuungen 
dieser Gruppe mit jeder Differenz immer geringer.) Dann kénnen wir zum 
System (33a) noch die fehlende Ungleichung — 7,< 0 hinzufiigen und erhalten 
dann: 7, >0;7,-—7, >0; 7, > 3 (7, —2r24+7;), us.w. Die Ungleichungen werden 
mit wachsenden ¢ immer komplizierter und weniger durchsichtig. Um zu ein- 
facheren Formeln zu gelangen, wollen wir jetzt noch eine weitere begrenzende 
Bedingung einfiihren: die linken Seiten des Systems (33a) werden namlich 
a fortiori < 0 sein, wenn alle V“ 7, > 0 (bei? = 1, 2, 3,... &). Und dann kommen 
wir zum System : 

>0; n>; Vri>V're; > Vrs, ... > VF 15. 
Dieses System ist viel handlicher und kann durch verschiedene “ Modelle ” darge- 
stellt werden. Am einfachsten wiirde wohl die Annahme sein: r,=p, 7r,=p*, 
r,=p*,... In diesem Falle ergiebt es sich niimlich, daB V'r;=p(1— py, 
und dieser Ausdruck bleibt immer positiv, solange 1 > p > 0. 


* Besiifen wir Kenntnis iiber den genauen Wert von Es, so kénnten wir leicht, falls schon 


N N-1 
E = (8;- Hs)? E & (d's)? 
, die fehlende Gleichung 7,;=0 erhalten und das System (33) eindevtig 


auflisen. Uber eine solche Kenntnis verfiigen wir aber nur in den‘seltensten Fillen, Aus der Gleichung 
N N-1 
E = = (A’a,)? 
Nol 2(N-1)”’ WO §m=H PAS erhalten wir die fehlende Gleichung nicht, da in ihr, 


dank der Einfiihrung von s,y, alle 7; von 7; bis ry_; auftreten werden, Und wenn man sogar bis zur 
(N - 1)-ten Differenz vorgeht, so erweist es sich, daB diese Gleichung schon im vollen System (33) ent- 
halten ist, also nichts neues bietet. 

Direkt ist kein einziges r; zu bestimmen, da es keine einzige empirische Formel gibt und geben 


kann, deren mathematische Erwartung genau 7; ausmachen wiirde. Vergl. hieriiber noch unten, eine 
Anmerkung in § (12). 


+ Ebenda, S. 142—143. 
Siehe Form. (9). 
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Andererseits, wiirden wir auch in dem Falle zu einfachen Formeln gelangen, 
wenn wir wieder annehmen, daB die Reihe 7 durch eine gewisse Parabel i-ter 
Ordnung dargestellt werden kénne. Dann wiirden alle endlichen Differenzen von 
der Ordnung (i+ 1) aufwiirts gleich 0 sein. Verschwinden, zum Beispiel, schon 
die zweiten Differenzen, so erhalten wir folgendes leicht fabliches System : 


Verschwinden erst die 3-tten Differenzen, so kommen wir zum System : 
Te +1, + 


Solch ein Verhalten wiirde fiir eine “zufiillige” Variable, als welche wir die 
“restliche Komponente” ansehen, wenig wahrscheinlich sein, kénnte aber sehr 
wohl ein Gegenstiick fiir die Hypothese einer parabolischen Beschaffenheit der 
“glatten Komponente” abgeben. 


SchlieBlich sei noch bemerkt, daB auch Yule’s Sinus-Reihen mit der Formel 
u; =A sin (2 im Zwischenraum und TZh28T ebenfalls 
zur “ G-Gruppe” gehéren. 


Was nun den 3-tten Fall anbetrifft, das hei8t, die Bedingungen fiir eine solche 
“restliche Komponente,” welche beim endlichen Differenzieren rascher wachst, als 
es eine Reihe mit gegenseitig unabhangigen oder uniform verbundenen Versuchen 
zulaBt (“ Z-Gruppe’’), so kénnen, wie gesagt, zu ihrer Untersuchung ebenfalls die 
Formeln des Systems (33a) zugezogen werden, indem hier vor 0 iiberall das 
Zeichen > gesetzt wird. Und ebenfalls a fortiori kann man auch folgende Bedin- 
gungen aufstellen : 


7,<0, V'n<V're, < ... < 


Diesem System wiirde, zum Beispiel, eine jede Reihe s geniigen, bei welcher alle r; 
bei ungeradem i negativ und bei geradem i positiv sein wiirden. Zum Beispiel : 

(1) n=p, ... n=p', wenn 0>pz—1; oder: 

(2) wenn a>| Aj und|A\|+a<1; 
oder: 


(3) r;=A+(—1)' (a+ th), wobei jedoch | kh |<a>0 sein wenn h negativ 
ist, und auBerdem | A |+ a+) kh|< 1, us.w. 


In diese Klasse gehéren auch Yule’s kurzperiodische Sinus-Reihen. 


(11) Wenn wir nun von diesen wahrscheinlichkeitstheoretischen Schemata 
zur statistischen Wirklichkeit zuriickkehren, wo, in der Regel, weder 7; noch 
Eo;? gegeben sind, so ist es am vorteilhaftesten, von der Betrachtung der Diffe- 
renzen 0,2, On? —O%4, auszugehen. Ké6nnen wir an 
Hand der Formeln der Tabelle II feststellen, da® diese Differenzen relativ 
noch so klein sind, da®B man modo Bernoulliano das Bestehen der Gleichungen 
Eo? = Eo”;,, =... annehmen darf, so kann man, auf Grund des im vorherge- 
henden Paragraphen gesagten,—mit einer Wahrscheinlichkeit, die desto gréBer 
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wird, je weiter der Zwischenraum zwischen i und k,—annehmen, daB die i-ten endl. 
Differenzen der Reihe w,, we, ... wy nur lineare Funktionen einer zur “ R-Gruppe” 
gehérigen Komponente s enthalten. Man kann dabei mit einer ziemlichen Sicher- 
heit, wenigstens fiir gréBere N, erwarten, daB die Glieder derselben entweder 
unabhiingig von einander oder, im schlimmsten Fall, miteinander nur uniform 
verbunden sind. Bleiben die Differenzen der Streuungen bis o,?—o%,, im 
Verhiltniss zu ihren mittleren Fehlern groB und dabei negativ, so kann man noch 
mit héheren Differenzen sein Gliick versuchen. Sind sie, im Gegenteil, groB und 
positiv, so bedeutet das die wahrscheinliche Anwesenheit einer Komponente von 


der “ Z-Gruppe.” 


Es muB hierbei noch folgendes bemerkt werden. Gehért die Reihe s zur 
“ R-Gruppe,” so besitzt die Reihe A’s,, A’s,,... schon die Eigenschaften einer 
“ Z-Gruppe,” und je héher k, die Ordnung der Ditferenz, desto mehr werden diese 
“ Z”-Eigenschaften hervortreten. Besteht nun jedes Glied der Reihe: s,, 83, ... 
selbst aus einer Summe von 2 Komponenten: einer “R”-artigen und einer 
“Z”-artigen, so kann die “schidliche” Wirkung der letzteren in den meisten Fallen 
sich nur in den wenigen ersten Differenzen bemerkbar machen. Das Maximum 
der “ Schiidlichkeit ” erreicht eine Komponente der “ Z-Gruppe” bei 


1=-1, 


Nun ist fiir eine solche Reihe 


wahrend eine von der “ R-Gruppe” blob Boil #, ergeben wiirde, oder, angenahert, 


k 
Hz (vergl. Form. 11a), Fiir k= 6 wiirde das eine 4499 x fache VergréSerung 
pa 


des “relativen Einflusses” der mazimalen “Z”-Komponente bedeuten. Eine 
weitere Verdoppelung desselben tritt aber nur bei k= 24 ein. Ist daher, zum 
Beispiel, an der Differenz o,?— ¢;? die Wahrscheinlichkeit des Vorhandenseins 
einer “ Z-Gruppen”-Komponente nicht festzustellen, so ist es auch verhiiltnis- 
mafig wenig wahrscheinlich, die weiteren Differenzen eine solche Kompo- 
nente zum Vorschein treten lassen kénnten (wenn, natiirlich, in der 6-ten Differenz 
die Komponente “G@” schon eliminiert ist). Diese “Z”-Komponente existiert, 
offenbar, tiberhaupt nicht oder ist im Verhiiltnis zur “R”-Komponente nicht 
bedeutend. 


Es entsteht nun die Frage: sind die méglichen Ergebnisse einer Untersuchung 
der Streuungen der Differenzen einer gegebenen Reihe auch geniigend wichtig, 
um die hierbei erforderliche Rechenarbeit zu rechtfertigen? hat die Unter- 
scheidung der Gruppen “Z,” “R” und “G@” irgendeinen Erkenntniswert auch 
auBerhalb der Anwendung der Differenzenmethode ? 


(Hin Schlufartikel folgt.) 


: 
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ON THE MEANS AND SQUARED STANDARD-DEVIATIONS 
OF SMALL SAMPLES FROM ANY POPULATION. 


By A. E. R. CHURCH, M.A., B.Sc. 


Part I. IntrrRopuctory. 


THE theory of sampling, both as to mean and standard-deviation, has been 
known with completeness for some considerable time, provided the sampled 
population follows the normal law. That the distribution of means follows the 
normal law whatever the size of the sample, provided the sampling is from normal 
material, has been known from the early days of probability theory and the 
moment relations of the distribution of means of such samples were determined 
by Tchebycheff many years ago. The distribution of standard-deviations of 
samples from normal material has also been fully worked out both for small 
and large samples, and actual experimental verification of the theoretical formulae 
obtained was provided by “Student” in Biometrika, Vol. vi. pp. 1—25. 

While the work indicated above has rounded off the case of large or small 
samples from a normal population, both from the point of view of their means 
and their standard-deviations, it gives us no ability to deal with the case of 
samples from material which is definitely skew in character. 

However, relations of the type : 


where ,M,’, ,M,, .M;, and =, are the mean, second, third and fourth 
moment-coefficients, first two betas and standard-deviation respectively of the 
distribution of means of samples of V, whilst 4)’, 2, fs, Bs; B,, B. and @ are the 
corresponding constants of the sampled population, follow from Tchebycheff’s 
work and enable us to obtain some appreciation of the distribution of means of 
samples of any size from any population. It is clear from them that, if the 
deviation of the sampled population from normality be not very great, the dis- 
tribution of means of samples of NV itself will not diverge widely from normality. 
In fact for 8,<0°2 and B.—3 <0 it is to be expected that, even for V=10, a 
normal curve will give a really excellent fit to an experimental distribution of means, 


id 
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As regards theoretical distributions of means of samples of NV obtained directly 
without the use of moments from a given sampled population, when this population 
is a Type III Pearson Curve the corresponding distribution of means has been 
deduced in Part II by direct integration, the sampled population being considered 
“infinite ” in all its categories *. 


The deduction of 8-relations for the distribution of o? of samples drawn from 
any sampled population infinite in all its categories is due to Tchouprofft. At 
first sight these relations place the distributions of means and o* of samples from 
non-normal material on the same footing; that is the constants of these dis- 
tributions can be computed and frequency curves to represent them can be 
determined. But as far as calculation is concerned, the two cases are on somewhat 
different footings. 'T'chouproff’s formulae are very lengthy, one of them having 
37 terms in it; but, what is much more troublesome in practice, the evaluation 
of the formula for the 4th moment-coefficient of the distribution of o? of samples 
involves a computation of the moments of the sampled population up to and 
including the 8th. Again Tchouproff’s formulae are absolute, not approximate, 
but it would be wrong to suppose that, for small and moderately small samples, 
the terms in the higher inverse powers of NV can be neglected summarily in com- 
parison with the terms in the lower inverse powers of N. For such values of the 
moment-coefticients as obtain for moderate skewness of the sampled population 
and moderately small samples, I find there is not such a rapid convergence of the 
Tchouproff terms as to warrant the general neglect of the terms in the higher 
inverse powers of NV. This is a practical illustration of the theorem developed by 
Isserlis, namely that the moment-coefficients of the distribution of o? of samples 
from an infinite population do obey, when the sample is large enough, the normal 
law, but the sample must be very large before we can look upon the distribution 
of even the 4th moment-coefficient as normal. 


Now the computation of the 8th moment-coefficient of the sampled population, 
as required by Tchouproff’s formulae, can be carried out in a special experimental 
investigation like this, but can hardly be demanded in everyday statistical 
practice. Accordingly I looked round for a method of determining the higher 
betas without direct computation and it seemed desirable to ascertain whether, 
assuming the applicability of Pearson’s general frequency curves which provide 
finite difference relations{ between successive betas, it is possible to determine 
the higher betas of the sampled population from the lower ones by these formulae 
with sufficient accuracy to obtain good fits to the experimental distributions of 
o* of samples obtained from the skew populations employed. 


* The term ‘‘infinite sampled population” has a quite simple and definite meaning for the 
statistician, although it seems to have troubled some mathematicians recently (see Burnside : Camb. 
Phil. Soe. Proc. Vol. xxu, pp. 726—7). I indicate its significance below. 

+ Biometrika, Vol. xu. pp. 193—194. For the correction of Tchouproff’s formulae in the case of 
the 4th moment-coefficient by the present writer, see Biometrika, Vol. xvi. pp. 79—83. 

t Tables of B;, 8,, 8; and Bs in terms of f; and fy are given in Tables for Statisticians, pp. 73—79, 
but as some errors occur they have been recalculated by Professor K. Yasukawa : see pp. 268—275 of 
the present issue of this Journal. 
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It is well at this point to draw attention to an important matter. The 
population from which the samples are taken will be in general itself a sample 
for a much larger population, and hence its higher moment-coefficients may differ 
from their “true” values by very considerable variations, by even 40 or 50 per cent. 
Accordingly on this ground alone we should not be justified—except for the special 
purpose of testing our formulae experimentally—in laying any marked stress on 
the exact values of these higher betas of the sampled population. 


This method of obtaining rapidly the higher betas of the sampled population 
will be shown to be successful and thus, expressing Tchouproff’s formulae in terms 
of the betas of the sampled population, we can use for the higher betas their 
values in terms of the lower ones as obtained by the difference relations. 

While formulae for the moment-coefficients of the distributions of means and 
o* of samples of NV from any sampled population may now be said to be established 
and confirmed experimentally, it must yet be remarked that they are dependent 
on the sampled population being indefinitely large in all its categories compared 
with the size of the sample. In practice this means that when any individual is 
drawn in actual sampling, the chance that the next one drawn will belong to a 
definite category of the sampled population must be quite unaffected by this 
previous “draw,” or otherwise individuals must be drawn singly and replaced 
before the next is drawn. This method is generally referred to as sampling from 
an “infinite” population. If the whole sample be drawn before replacement, the 
frequency of draws from a single category of the sampled population will no longer 
follow a binomial but a hypergeometrical distribution. This method is generally 
referred to as sampling from a “ finite” population. 

Turning now to the consideration of sampling from a finite population, the 
moment-coefficients of the distribution of means of samples of V have been 
known for some time, although it is not clear from whom they originated, whilst 
Dr J. Splawa-Neyman* has given the mean and 2nd moment-coefficient of the 
distribution of o? of such samples. However, he did not give formulae for the 3rd 
and 4th moment-coefficients of the distribution of o* of samples, which moment- 
coefficients are required if the distribution is to be represented by a Pearson or 
other frequency curve. These formulae proved very laborious and difficult to 
obtain, and when obtained were found to be extremely unwieldy and complicated, 
but they are given together with an outline of the method of obtaining them in 
the last part of this paper. 

This very considerable piece of work has apparently placed the discussion of 
the distributions of means and o* of samples of V from a finite population on the 
same basis as that given for the corresponding distributions obtained from an infinite 
population. This is more or less true for the distribution of means of samples, for 
like the corresponding distribution from au infinite population, it can be repre- 
sented by a Pearson curve obtained from the values of the moment-coefficients 


* Biometrika, Vol. xvu. p. 472. In this paper Dr Neyman also gives the moment-coefficients of the 
distribution of means of samples. 
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given by their formulae, and moreover, even when the sampled population is very 
skew and as small as 36, the distribution shows a remarkable approach to normality. 

In considering the distribution of o of samples of NV from a finite population, 
the formulae for the moment-coefficients being so unwieldy, it is found that even 
moderate-sized sampled populations must be treated as infinite; that is, Tchouproff’s 
formulae must be used in lieu of the proper formulae for these moment-coefficients, 
if the labour involved in the representation of the distribution is to be kept within 
limits that are reasonable even for an investigation like this. In practical statistics 
the labour involved would undoubtedly prohibit in all cases the use of these proper 
formulae for the moment-coefficients. 

These formulae could be used for very small sampled populations like the 
binomial considered below, but judging from this case, it is likely that the 
resulting frequency curve would in no way represent even the general trend of 
the frequency groups of an actual distribution of o* of samples obtained from the 
population. This distribution of o* of samples in the case considered proved to be 
a discrete frequency with widely varying intervals between the separate frequency 
groups, and was moreover tri-modal. This rather unprecedented and startling 
result, coupled with the unwieldiness that is certain to appear in any attempt 
to find formulae for the higher moment-coefficients of the distribution of o* of 
samples, means that, at present, the representation of such a distribution by a 
smooth frequency curve is practically impossible, if the sampled population be 
very small. 

For the true application of all this theory, too much emphasis cannot be laid on 
the fact that the samples in any actual case must be truly “random” in character. 
The recording of the individual, its return to the “ bag” and the thorough shuffling 
between each draw involve, in the case of 10,000 or 20,000 draws, a very great 
expenditure of time and labour. At the same time, in my experience and that of 
other workers in the Biometric Laboratory, it is impossible with many hundreds 
of marked tickets to ensure the returned ticket being equally likely to be any- 
where in the bag or bowl, and so equally likely to be drawn next time as any other 
ticket. In fact, working with many tickets, perfect shuffling seems a practical 
impossibility. 

However, when the sampling is conducted on the basis of an infinite sampled 
population, the most satisfactory method seems to be to procure one good random 
sample, such as that obtained by Mr L. H. C. Tippett, and to apply this to all 
samplings of this character. This method, which is described later, has also the 
great advantage of comparative speed and ease of operation, and I wish at this 
stage to thank Mr Tippett for the loan of his set of random numbers, for it is 
undoubtedly a boon when sampling from an infinite population. 

When the sampling is made on the basis of a finite sampled population, 
however, such a set of random numbers is at present of no avail. Consequently, 
whilst discussing the distributions of means and o* of samples of N from such 
populations, material other than tickets was used in order to find a way of 
minimising the difficulties of shuffling. 
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When the sampled population, though finite, is large, quite approximately 
symmetrical and homogeneous material like small commercial coloured glass beads 
has been found to overcome the difficulties of shuffling to the extent necessary to 
produce good agreement between theory and practice. When however the sampled 
population is quite small, satisfactory results can only be obtained by using material 
like coloured marbles, which approximate very closely indeed to the obvious ideal 
of complete symmetry of shape and homogeneity of size and weight. 


Turning now to the consideration of the experimental material used, it should be 
remarked that the number of significant figures given for the moment-coefficients 
is only adopted in order to obtain the higher betas with the accuracy needful for 
their use in Tchouproff’s formulae*. They can, of course, be dropped as soon as 
the two betas of the distribution of means or of second moments of samples, as the 
case may be, have been reached. The two sampled populations used in the dis- 
cussion of samples from an infinite population are the following: 


(A) Barometric Readings taken over a period of eleven Years at Laudale in Scotland. 


Total Frequency 4011. 


| 
28-2 28°3 | 28°4 | 28°5 


Central Bar. 
Height in ins. 


28°6 | 28-7 | 28-9 | 29:0 29:2 | 
|— 


| | 
Frequency 0 2 2; 41 8 12 | 32 | 47 | 72 | 87 | 125 


| 
29°5 | 29°6 | 29°7 | 29-8 | 29:9 | 80°0 | 30°1 | 80°2 | 30°3 | 30°4 | 30°5 | 30°6 | 30-7 | 30°8 


| 
| | 28-0 | 28-1 


| 
| 


Central Bar. 


Height in ins. | ~ 


255 | 304 | 346 394 | 380 400 | 361 | 376 | 266 | 182 


| | | 


Frequency 187 


94-| 50 21 2 


The Constants for this distribution were found to be as follows: 


[Working Origin 29°7 ins. of Bar. Height. Working Unit 0°1 in. of Bar. Height.] 


y= 1°580,155, v3 = 1,416°561,456, 
= 17:255,547, 57,788'534,530, 
49°231,613, =— 155,697-948,891, 
759°882,324, vs = _7,868,534°826,228. 
Mean = 29°858,0155 ins. of Bar. Height. 
= 14°758,657, = 70,469°386,836, 
v, =— 24676,758, vy =— $§18,237°881,078, 
688°517,798, vg = 13,875,641°654,318, 
Vs = — 3,999°253,754, o =0'384,1700 in. of Bar. Height. 
B, = 0°189,424, = 3°160,978, 8, = 2:080,078, 
B, = 21°920,980, Bs = 292°459,489, 


The Skewness is ‘237. 


* Some of the powers and products of moment-coefficients have to be multiplied by numerical 
coefficients of the order of 1300. 


| i 
= —| —— 
| 
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(B) A Graduated Frequency Distribution obtained from a smooth Frequency 
Curve given in Biometrika, Vol. x1. p. 400. 


Total Frequency 10,000. 


> 


Frequency 1 | 2 4 8 15 | 27 47 78 | 125 | 191 | 282 | 399 | 541 
| 
| | 
Group | 5°93 | 8 | j | 0] 
Frequency 700 | 861 | 1005 | 1101 | 1127 | 1064 | 910 | 691 | 451 | 241 | 98 | Q7 1 
The Constants for this distribution were found to be: 
18518, 2,274°2528, 
Ve = 16°1684, 35,855°6624, 
= 55°8272, = 95,052°5432, 
vy, = 6285896, ve = 2,676,103°2296. 
Mean = 1°8518. 
Vy = 12°739,237, Ve = 38,480°027,968, 
=— 21:294,481, v, =— 329,631°122,867, 
512°453,377, ve = 4,3868,582°223,120, 
v; = — 2,571:063,549, =3°'569,207. 


B, = 0°219,333, 
A, = 18°612,518, 
The Skewness is ‘263. 


B, = 3°157,676, 
Bs 


B; = 2:078,769, 
= 165°869,619. 


Since for the purpose of sampling the material must be arranged in groups 
and the individuals of one group all given the same class-index, it is not only 
unnecessary, but even erroneous, to apply Sheppard’s grouping corrections to the 
moment-coefficients, before using them to determine the distributions of means and 
squared standard-deviations of samples. Consequently in the preceding values, and 
throughout this paper, the corrections for grouping have not been used, the betas 
being calculated directly from the raw moments (v’s) transferred to the mean. 


The sets of samples taken from these two populations (A) and (B) are indicated 
in the following, the constants of the corresponding distributions of means and 
squared standard-deviations (second moments) of samples being given at the 


same time. 


I. 1000 Samples of 10 from Population (A), the sampling being on the basis 
of an “infinite” population. 


Method of Sampling, that of drawing single tickets with return from a bag. 


Bar. 

eee in 

| | 

| | 

| Bar. 

in 

| 

| | Fre 

| 
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(a) Frequency Distribution of Means of Samples. 
Total Frequency 1000. 


ine 29°275- | 29°325- | 29°375- | 29°425- | 29-475- | 29°525- | 29°575- | 29°625- | 29°675— | 29°725- | 29°775- 


Frequency 1 1 0 7 9 18 22 52 83 

| 


125 132 
a — 29°825- | 29:875- | 29°925- | 29-975- | 30-025- | 30-075- | 30°125- | 30-175- | 30°225- | 30°275- | 30°325- 
| 
Frequency | 158 145 101 638 | 387 29 13 3 0 0 1 
The Constants were found to be: 
[Working Origin = 29°8 ins, of Bar. Height. Working Unit = 0:05 in. of Bar. 
Height. } 
0741, Mean=  29°837,050 ins. of Bar. Height. 
ve = 8149, i= 7599,919, o = 0'137,840 in. of Bar. Height, 
vy, = 14367, Vs =- 2°934,489, 8, = 0°019,617, 
y, = 213-217, vy, = 196°575,508, B, = 3°403,388. 


The Skewness is ‘057. 


(b) Frequency Distribution of o's of Samples. 
Total Frequency 1000. 


| Value of 


| | 35-| | 55-| 7-5- | 85- | 9:5- | 10°5- | 11°5- | 19°5- | 18°5- 

| Frequency| 1 3 9 18 25 39 48 60 56 64 51 71 52 

| 

ly 

of 15°5-| 16°5- | 17-5- | 18°5-| 19°5- | 20°5- | | 22-5- | 23-5- | 24-5- | 25-5- | B6-5- 

Preven 6 | 47 | 53 | 46 | 32 | 40 | 43 | 19 | 25 16 | 16 | 14 | 16 | 

| Value 

Value of | | 98-5- | 29°5- | 30°5- | 31°5- | 32-5- | 33-5- | 34°5- | 85-5- | 36-5- | 37-5- | 38-5- | 
| 

| Frequency 13 5 9 7 3 2 5 3 0 4 2 2 a 

| 

| of | | 41-5- | 4a-5- | | 44°5- | 45-5- | 46-5- | 47-5- | 48°5- | 49°5- | 50°5- | 51-5- | 59°5- 

eee 1 1 1 1 tv) 0 2 1 1 2 2 0 3 | 


The Constants were found to be: 
[Working Origin = 14.] 


y= 1:848, Mean = 15°848, B, = 2°149,615, 
= 64°578, Vy = 61°162,896, = 6'°560,572, 
= 1,046°712, Vs = %01°313,792, o =7°820,671. 
= 380,991°482, Vs = 24,542°441,607, 


The Skewness is *643. 


Bag | | 
| 
| 
a 
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II. 1000 Samples of 10 from Population (A), the sampling being on the basis 


of a “finite” population. 


Method of Sampling, that of drawing tickets from a bag, complete samples 


being drawn before return to bag. 


(a) Frequency Distribution of Means of Samples. 
Total Frequency 1000. 


Bar. Height | 99. 445_ | 29-485- | 29°525- | 29°565- | 29-605- | 29°645- | 29°685- | 29-725- | 29-765-| 29°805- 
Frequency 1 4 11 16 | 21 37 69 94 128 120 
29°845- | 29°885- | 29:925-| 29°965- | 30°005- | 30-045- | 30°085- | 30°125- | 80°165- | 30°205- 
Frequency 115 129 89 77 45 26 11 5 1 1 


The Constants were found to be: 


[Working Origin = 29°825 ins. of Bar. Height. Working Unit=0-04 in. of 


Bar. Height. ] 


v= 0475, Mean= 29°844,000 ins. of Bar. Height, 8, =0°013,218, 
9459, = 9233,375, B, = 2°907,286, 
vs = 10°039, Vs =— 3225,731, o =07121,546 in. 
v¢ = 254283, Vs = 247°861,301, of Bar. Height. 


The Skewness is ‘063. 


(b) Frequency Distribution of o*’s of Sumples. 
Total Frequency 1000. 


| 
Frequency| 2 4 | 5 | 19 24 35 47 56 83 58 82 65 
Value of | 12-5 | 13°6- | 14-5-| 15°5- | 16°5-| 17-5-| 18-5-| 19°6-| 20°5- | 21°5- | 22°5-| 25-5- 
| | 
Frequency) 54 | 67 | 57 | 63 | 44 | 34 | 33 | 19 | 26 | 17 | 2 |] Un 
| 
| 
of | 25:5-| 26-5-| 27-5- | 28-5- | 29°5- | 30°5- | 31-5- | | 33-5- 34°5- | 35-5- 
| 
Frequency, 10 | 7 9 7 6 2 4 6 3 1 1 2 
| 
| 
of | 96-5_ | 37-5-| 38-5- | 39°5- | 40°5- | 41-5-| 49°5- | 43°5- 
Frequency| 1 3 1 | 1 1 0 0 0 1 | 


4 
| 
| | 
| 
| 
| 
ge 
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The Constants were found to be: 


[Working Origin = 12.] 


«£916, Mean= 13-915, By = 1'113,857, 
46419, v, = 42°751,775, =4°684,768, 

550-908, vy, = 298271,317, = 6°538,484. 

= 11,801°307, y, = 8,562-417,905, 


The Skewness is °524. 


III. 1000 Samples of 10 from Population (A), the sampling being on the basis : 
of an “infinite” population. 


Method of Sampling, that by aid of Tippett’s Table of Random Numbers*. 


(a) Frequency Distribution of Means of Samples. 
Total Frequency 1000. 


Bar. Height 


29°425— | 29°475— | 29°525— | 29°575- | 29°625- | 29°675— | 29-725— | 29°775- 


Frequency 2 3 7 20 44 67 109 158 


Bar. Height 


29°825— | 29°875— | 29°925— | 29-975- | 30°025- | 30°075- | 30°125- | 30°175- 


Frequency 146 163 123 85 A4 20 8 1 


The Constants were found to be: 


(Working Origin = 29°85 ins. of Bar. Height. Working Unit = 0°05 in. of Bar. 


Height. ] 

y= 0063, Mean= 29°853,150 ins. of Bar. Height, 8, = 0°019,694, 
5889, = 5°885,031, B, = 2°929,922, 
vi=— 0891, » =— 2°003,521, o =0°121,295 in. 
y= 101109, » = 101°473,726, of Bar. Height. 


The Skewness is ‘075. 


* A description of this method is given at the end of this part, 


Biometrika xv111 22 
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(b) Frequency Distribution of o°’s of Samples. 
Total Frequency 1000. 


| | | | 
of | | 3-5-| 55-| | 8-5-| 9°5- | 10°5- | 11°5- | 12°5- | 13°5- | 14°5- 
| | 
Frequency! 2 10 | 17 | 27 | 45 | 57 | 55 | 58 | 7 | 73 | 79 | 63 | 48 | 62 
| 
heey of | 15-5-| 16°5- 186- 19°5- | 20°5- | 21°5- | 22°5- | 23°5- | 2h-d- | 5 5-| 26:5-| | 
| | 
| | 
\Frequency| 59 | 42 | 43 | 29 | 94 | 27 | 15 | 13 | 15 | 13 | 11 6 | 4 | 4 
of | 99.5_| so-s-| 995- | 34-5-| 35°5- | 365-| 38-5- | 39°5- 40°5~ 41°5- | 42°5- 
| 
\Frequency| 5 | 1 3 | oe | 6 | 2 2 1 | 1 2 1 0 1 | 0 
| | 
| | 
Value of 43°5— | | 9°5— | 50°5— | §1°6— | | | | 55°5. 
| | | | 4B°5— | | 50°5— | 51°5— | 52°5- | 53°5- | 54°5- | 55°5- 
| — | | | 
|Frequency| O | 0 | 0 0 1 0 ) 1 0 2 0 0 1 


The Constants were found to be: 
[Working Origin = 12.] 


17 


25, 


48-805, 
748269, 
22,213°549, 

The Skewness is 


Mean = 


Vo = 


Vs; 


Vy 


13°725, 
45°829,375, 


505°969,031, 


= 17,895°282,136, 


B, = 2°659,598, 
B, = $520,217, 
o = 6°769,732. 


IV. 1000 Samples of 10 from Population (B), the sampling being on the basis 
of an “ infinite ” population. 


Method of Sampling, that by aid of Tippett’s Table of Random Numbers. 


(a) Frequency Distribution of Means of Samples. 


Total Frequency 1000. 


| ~249- | _1-99-| 1°49- —0:99- | — 0°49- | 0°01- | 0°51- | 1°01- | 1°51- 
Frequency I 2 | 6 16 | 44 53 | 129 | 152 | 188 
201 2°51- | 3-01- | 3-51- | | 4°51- 5-01-| 551 
| Frequency | 78 33 15 i) 1 


4 
a 
Vv 
— 
Fr 
V 
| 
| Fr 
— 
— 
y, = 

| 
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The Constants were found to be: 
[Working Origin = 1:76. Working Unit = 0°5.] 


vy, =— 0011, Mean= 1°7545, 8, = 0'010,318, 
65°81, Vs = 5:080,879, 
vy; =— 1°331, Vs =— 1163,329, o =1127,041. 
79°961, = 79°906,126, 


4 
The Skewness is 048. 
(b) Frequency Distribution of o*’s of Samples. 
Total Frequency 1000. 


of os- | | | 55-| 6°5- | | 95- | 10°5- | 11°5- | 18°5- 
| 
| 
Frequency! 1 1 18 | 34 | 66 | 67 | 74 | 80 | 62 | 85 | 71 | 61 | 67 | 51 
| | 
of | 15-5-| 16-5-| 17-5-| 18-5-| 19-5-| 20-5- | 91°5- | 29-5- 23-5- | 25°5- | 26-5- | 27-5- 
| 
Frequency] 48 | 34 | 33 | 34 | 96 | 92 | 8 14 | 7 7 | 5 4 2 5 : 
of | 28-5_ | 29:5-| s0-5-| 31-5-| | 35-5- | 36-5- | 87-5-| 38-5- | 39°5- | 40°5- 
| 
Frequency| 0 5 1 2 2 | 1 1 0 | Oo 0 0 0 1 
| 


The Constants were found to be: 


[ Working Origin = 10.] 


1609,  Mean= 11-609, B, = 1:136,733, 
34798, = 32204119, =4683,644, 
vi = 354463, », = 194848,209, = 5674,867. 


= 6,618°413, 
The Skewness is ‘539. 
V. 1000 Samples of 10 from Population (A), the sampling being on the basis 
of a “finite” population. 
Method of Sampling, that of drawing coloured beads from a bowl, complete 
samples being drawn before return to bowl. 


4,857°431,837, 


= 
ll 


(a) Frequency Distribution of Means of Samples. 
Total Frequency 1000. 


| Bar. Height | 99.05 | | 99-525-| 29-575- | 29-695- | 29-675-| 29-725-| 29-775- 
in ins. | 
| Frequency 2 | 2 7 | 25 | 34 67 92 136 
| | 
— | 29°825- | 29°875- | 29925- | 29-975- | 30-025-| 30-075- | 30°125- 
| 
| Frequency | 170 154 127 | 84 | 54 29 | 13 4 
| 
222 


\ 
e 
| 
| 
| 
\ 4 
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The Constants were found to be: 


[Working Origin = 29°85 ins. of Bar. Height. Working Unit = 0°05 in. of Bar. 
Height. 


y= 0274, Mean = 29°863,700 ins. of Bar. Height, 
ve = 6440, Ve = 6°364,924, o = 0'126,144 in. of Bar. Height, 
vy; = 3664, vy, =— 188,538, B, = 0:009,786, 
= 122°516, Vs = 121°384,282, B, = 2°996,236. 


The Skewness is ‘050. 


The distributions of means and o? of samples from the third population (C) are 
given, with that population, in Parts IV and V. 


Tippett’s Random Sample. 


Tippett’s Table of Random Numbers consists of 10,00 random numbers, the 
method of using them as an aid to sampling being as follows. The numbers range 
from 0000 to 9999 and, before using the table, each individual of the population 
is given one of these numbers. The population therefore must not be greater 
than 10,000 but generally it will be considerably less and consequently only 
certain of these numbers will be required. Suppose one category of the sampled 
population consists of 55 individuals, then these 55 individuals may be given the 
numbers say 0010—0064, and on traversing the table in any systematic manner 
(down columns or across rows for example) whenever one of these numbers is met 
an individual of this category is considered drawn. All categories of the original 
population are similarly dealt with and the table is traversed until the requisite 
number of “draws” has been made. For example, if 500 samples of 5 are required, 
2500 “draws” must be made and these put into groups of 5 in any systematic 
manner. It may be, as in the present work, that the number of “draws” required 
to complete the sampling has not been attained when the table has been com- 
pletely traversed. In this case, the requisite number of extra “draws” must be 
obtained, either by giving the individuals of the sampled population a new set of 
numbers out of the 10,000 and traversing the table as before, or by retaining the 
numbers already given and re-traversing the table in a different order. 


The advantages of Tippett’s Random Sample are firstly, that its randomness is 
reasonably established and consequently any set of samples obtained by it may be 
considered a random set, secondly that it is very much speedier in application 


than the ordinary method of drawing tickets or other material from a bag, bowl, 
or urn. 


Its disadvantage is that it necessitates the individuals of the sampled population 


being drawn one at a time, that is the sampling must be conducted on the basis of 
an infinite sampled population. 


— 
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When it is definitely desired to draw the whole sample at a time, i.e. a sample 
on the basis of a “finite” sampled population, the method of drawing objects 
from a bag or an urn must be employed. The difficulties of shuffling when the 
objects are tickets is well known, so in this work samplings made on this basis 
have been taken both by drawing tickets from a bag and also by drawing more 
symmetrical and easily shuffled material. This material was, in one case a large 
number of small commercial coloured beads whose homogeneity of size and shape 
was rough, and in the other case a small number of coloured marbles which 
approximated much more to the ideal of complete homogeneity. This ideal of 
complete homogeneity in the individuals representing the sampled population is 
well known to be necessary in theory and desirable in practice ; but in the past, it 
has scarcely been recognised how necessary as well as desirable it is in practice, 
especially when sampling from small populations. 


Part If. THe DistrRinutTion oF MEANS OF SAMPLES TAKEN FROM 
AN “INFINITE ” POPULATION. 


(a) As mentioned in the introduction the relations known to exist between 
the moment-coefticients of the Distribution of Means of Samples and the corre- 
sponding constants of the Sampled Population indicate that, when NV, the number 
in the sample, is large, the Distribution of Means should be well represented by a 
Normal Frequency Distribution. When WV = 10, although a similar result may be 
expected, it is not so clear that a close approach to normality will be obtained 
in practice. Consequently it was considered of importance to fit a normal curve to 
the three experimental distributions (I(a), I11(a) and IV (q@)) employed in this 
portion of the work. 


The results were as follows: 


Distribution I (a). 
Bar. Height Normal Pcie Bar. Height Normal anor 
in ins. Frequency in I (a) in ins. Frequency in I (a) 

29°275— 0°0730 1 29°825— 144°0537 158 
29°325— 0°2813 | 1 29°875— 13074000 145 
29°375— 09491 0 29°925— 103°4294 101 

29°425— 2°8079 7 29:975— 7179655 6: 

29°475— 7°2799 9 30°025— 43°8715 37 
29°525— 16°6427 18 30°075— 23°4632 29 
29°575— 32°8911 22 80°125— 10°9927 13 
29°625— 56°6393 52 80°175 4°5189 3 
29:°675— 89°2672 83 30°225— 1°6269 0 
29°725— 118°5678 125 30°275— 0°5140 0 
29°775— 139°5584 132 30°325— 071423 1 


Testing for Goodness of Fit with 17 Groups, this gave quite a fair fit with 
20-41 and P =-2031. 
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Distribution ITT (a). 

Bar. Height Normal Bar. Height Normal 

in ins. | Frequency in III (a) in ins. Frequency in III (a) 

| | 09037 2 29°825— 163°2387 146 
29°475— 2°5000 3 29°875— 151°7139 163 
29 °525— 7°5084 7 29°925— 1192535 123 
29°575— 19°0701 20 29°975— 79°2800 85 
29°625— | 40°9641 44 30°025— 44°5745 44 
29°675— | 74°4157 67 30°075-- 21°1937 20 
29°7295— | 114:3247 109 30°125— 8°5221 8 

| 29°775— | 148°5447 158 380°175— 2°8981 | 1 


Testing for Goodness of Fit with 15 Groups, this gave an excellent fit with 
== 7:33 and P = -9197. 


Distribution IV (a). 


| N 1 Actual | Actual 
Value of Mean | 5, basta Frequency } Value of Mean | seote | Frequency 
requency in IV (a) Frequency in IV (a) 
—2-49— 0°3633 1 2°01— | 1590085 157 
-1:99— 1°5499 2 2°51— | 118°6741 121 
—1°49— 5°4475 6 3-01— 72°9834 78 
— 0:99— 15°7701 16 36°9808 33 
-0-49— 37-6160 44 | 15°4375 15 
— 73°9223 53 5°3100 4 
0°51— 119-6895 129 5°01— 1°5044 0 
1‘01— 159°6869 152 | 0°3512 1 
1°51— 175°5419 188 | 
| 


Testing for Goodness of Fit with 14 Groups, this gave quite a good fit with 
x? = 11°42 and P =°5758. 

It is apparent from these three values of P that the Distribution of Means of 
Samples approaches normality in practice very closely indeed, even when N is as 
low as 10, provided the original population is not of an extremely skew character. 
In fact, for practical applications, the assumption that the Distribution of Means 
is a Normal Distribution would be quite justified and would lead to quite satis- 
factory results, except possibly when N was very small indeed, say 2 or 3, or if the 
sampled population was extremely skew, WN being still rather small. 


(b) However useful the assumption of normality may be when dealing with 
the distribution of means of samples in purely practical cases, it is highly desirable 
that a curve for this distribution should be obtained which shows its skewness, 
slight as this may perhaps be, and which represents generally the distribution 
from a theoretical point of view. 

To this end it was hoped that, commencing with the various Pearson Frequency 
Curves, the equations of the corresponding Curves of Means of Samples of V 
might be obtained directly without the use of the method of moments, but success 
resulted only in the case when the sampled population was of Type III. 


| 

| 

| 

| 

| 

q | 
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The solution of this is now given, but as Populations (A) and (B) are both 
represented by Type I curves, the result could not be applied to either of them 
with the expectation of any reasonable accuracy of representation. In point of 
fact, as the betas of (A) and (B) are such that the points representing them 
in the usual diagram, although distinctly within the Type I area, are in the 
neighbourhood of the Type III line, this application was actually attempted by 
treating the populations as if they were of Type III. As was to be expected 
however, the Distributions of Means of Samples so deduced showed no agreement 
whatever with Distributions I(a) and III(a) or IV(a), and consequently the 
method is not considered worthy of further consideration here. 


The Theoretical Curve of Frequency of the Distribution of Means of Samples 


of N drawn from an “infinite” sampled population which is represented by a 
Type III Pearson Curve. 


The equation y=y,e-% (1 +2)" of the Type III curve can be written 


y=ye?* «, where p=ya, by transforming the variate to #=1 +=. Here the 
origin has been changed so that the variate is always positive and the unit in 
which it is measured is : times the original unit. 

Now the probability that the values a, ...ay of the variate #, forming a 


sample of V from the above population, lie between and a, + da,, a, and + 
ty and is 


Constant (a, 22... dx, dias... day. 
Changing the variables to 


X, = 
X;= 


we have J = 1, and this expression becomes 


Constant (X, — X, —X, —... Xy)P dX, dX... dX y. 
Putting C, = X,-— X,—...— X,y and integrating for X, between 0 and C, the 


expression becomes 
Constant Xy)°dX,dX, ... dXy | Xp 
which, putting Y,= C,Z,, becomes ; 
Constant e?*: (X,X,... Xy)? dX, dX,...dXy.C”. B(p+1, p+ 1), 
that is 
Constant (X, — X;—... (X,X,... dX,...dXy. 
Thus the result of this integration with regard to X, is to leave the form 


of the expression unaltered except that the power of the difference factor has 
been raised by p+ 1. 


: 


336 Means and Squared Standard-Deviations of Small Samples 


Therefore repeating the process V —2 times to eliminate all the X’s except X,, 
the expression becomes 
Constant X,VetN-1 e-e% dX,. 


But if mis the Mean of the Sample, Nm =X,, therefore the probability that 
the value of the Mean of a Sample will lie between m and m + dm is 
Constant m —1 dm, 
and the Frequency Curve of Means of Samples is of the form 
Y=Yo mN (p+) e-Npm, 
To determine y% let n be the total frequency of this Distribution of Means of 
Samples, that is let n be the number of samples taken. 


Then n= | ydm 
0 


=Y mN New dm, 
0 
and putting Npm =z, 
= (p+ 0) 
Pp 
Yo= 
P(N (p 
Thus the curve of frequency of Means of Samples of V drawn from a sampled 
population represented by a Type III curve is 
n (Np) (p+) 
(p+ D} 


where n is the total frequency of the distribution. 


therefore 


MN (p+1) -1 e~Npm | 


We now turn to the constants of this Distribution of Means of Samples. 


With the notation employed previously for the constants of the Distribution of 
Means of Samples of NV, and n being the total frequency, we have 


n.,M, = e~Npm dm 
0 


ZN (p+1) 


(NpyW where z= Npm 


{N(p+1)+ 1} 
(p+1)+1”” 
,_ 1 P{N(p+1)+]} +1 
Th fore M, 


he 
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Thus in the original unit 


P Y 
which shows that here, as is necessary, the mean of this distribution is the mean 
of the original population. 


Again n. iM = | e-Npm +1 dm 
0 


© g-2 gN(pt+1)+1 
=Y% (p+1) +2 dz 
0 )+ 


P{N(p+1)4+2} 
Np 


Therefore = (p +1) +1} 
N*p? 
Pp 


Therefore LM, (= = 1 


N 
M,= in the original unit 
N° 

N 


i.e. 
- in terms of the standard-deviation of original 


population. 
Similarly 


N* p* 
1 2 1 
but —3.,M, +2 (My. 
Therefore iM, = 


in the original unit. 


Again, in a like manner, 


N*p* ? 
but iM, = — 4.,M) ..M; + 6 — 3 
1 6(p+)) 


i 
i.e. MM, 


in the original unit. 


| 
| 
2(p+l1 
ie 
H 
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4 
M. 6 
d 
an iB, 3+ 
and thus 2,B,=6+3,B,, 


showing this curve to be of Type III. 
Again, the betas of the sampled Type III population were 


= 4 ~ 6 
B, pti B, =3+ 
and therefore from this we have 
B, 3 


showing that this curve satisfies all the conditions required of a curve of frequency 
of means of samples. 


Thus, for a sampled population of Type III, we have a complete theoretical 
representation of any distribution of means of samples taken from it. 


(c) At present, however, the determination of frequency curves for the distri- 
butions of means of samples from populations of types other than Type III is 
restricted to the intermediary use of the method of moments. 


The sampled population and its moments being known, the formulae given in 
Part I can be used to determine the moments of the distribution of means of 
samples and hence, when the frequency type is determined, the Pearson Frequency 
Curve representing this distribution can be obtained. To determine the frequency 
type speedily, consider a pair of rectangular axes 8,=0, 8, =0 (as in Tables for 
Statisticians, XXXV, p. 66) and represent any distribution by the point (8, A.), 
whose Cartesian coordinates, with regard to the given axes, are its betas. Now 
considering the two points (8,, B.), (By, :B:) representing sampled population and 
distribution of means of samples respectively we have, from Part I, the following 
relation between their coordinates : 


_ _ 
B, iB 1 
where 2X is independent of NV the number in the sample, that is \ is a constant for 
all distributions of means of samples derived from the original population. 
Therefore, given the point A, (B,, B,), the point B, (,B,, ,B.), lies on the line 
,B, = + 3, 


which passes through the Gaussian point and the given point A, (8,, 8.). 


r, 


B, GB _ 
N’ GA 
necessarily, the point B, i.e. (,B,, ,B,), lies on the line considered between G and A, 
and Tables for Statisticians, XX XV shows the frequency type immediately. 


Now since ,B,= we have from the diagram W” and since WV is >1 


J 
= 
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It may be noted that, if the original population be any of Types [, II, III, IV 
or VII and its betas are of moderate size, the frequency type of the distribution 
of means of samples is the same as that of the sampled population, agreeing with 
the result obtained in Part II (b). 


B.=0 


(0,3) @ § 


A Bo) 


This method was applied to Populations (A) and (B) in order to determine the 
degree of accuracy with which the actual Distributions I (a), III (a) and IV (a) are 
represented by the corresponding theoretical curves obtained in this manner. 

Population (A) has for its constants : 


Mean = 29°858,0155 ins. of Bar. Height, 
@ = 0°384,1700 in. of Bar. Height, 
B, =0°189,424, 8, =3'160,978, 
and its frequency type is Type I. Therefore the Distribution of Means of Samples 
of 10 is represented by a Type I curve and, from the equations of Part I, its 
constants are : 
Mean = 29°858,0155 ins. of Bar. Height, 
>, = 0°121,4852 in. of Bar. Height, 
= 0-018.942, ,B, = 3:016,098. 
With the usual notation, the fitting of these constants to a Type I curve gives 
r = 486°517,905, = 37,463°612,112; 
the quadratic is m? — 486°517,905m’ + 37,463°612,112 = 0, 
its roots are 390°606,521 and 95°911,385. 

Now as the », of Population (A) is negative, the ,M/, of this distribution is also 
negative and, both the roots of this quadratic being positive, m,’ is the larger root 
and therefore m, = 389°606,521, m, = 94°911,385. Using 0°1 in. of Bar. Height as 
working unit, b = 67°423,715 and a, = 54°216,202, a, = 13°207,422. 


N mm" (m, + m, + 2) 
(m+ my™*" T (m, +1). T +1)’ 
therefore log = 2°516,5925, 
and Yo = 3285432. 


The frequency curve representing the Distribution of Means of Samples of 10 
from Population (A) is thus 


389°606 ,521 1 


B B. lol B 2) 
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The Mean is 29°858,0155 ins. of Bar. Height and 


The Plotted Ordinates of this Curve, which is shown in Fig. I, are 


Mean-Mode = — 0:008,4290 in. of Bar. Height. 
Therefore the origin, which is the mode, is at 29°866,445 ins. of Bar. Height, and 
the range is from # = — 54°216,202 to « =13-207,422. Now the actual distributions 
of means of samples derived from Population (A) range from 29°275 ins. to 
30°325 ins., so it is sufficient to plot this curve from «= — to w= 45. 


x y y y 
-5°0 0°2390 —1:5 159°736 0°125 326°837 25 31°2449 
| 0°8729 -—1°0 237°122 0°25 321°595 3°0 10°2027 
-4°0 2°8415 —O°75 273°029 0°50 301°400 2°5745 
—3°5 8°2122 —0°50 302°381 O75 270°046 0°4694 
-3°0 20°9904 — 0°25 321°724 10 231°044 4 00686 
47-2366 —0°125 326°809 1°5 146°257 
— 2:0 93-0828 0°00 328°543 20 75°4552 
<8 
Llu 
= 
| \ 
HISTOGRAM OF 
| DISTRIBUTION IIl@). 
| ([] = FREQUENCY 2. 
L100 
T 
29-425 29-866 30-225 
29-858 


Means of Samples in ;, inch of Barometric Height. 


Distribution of Means in 1000 Samples of 10 from the Skew Distribution of Barometric Heights (A). 
Fitted Curve from theoretical values of o, 8, and 8, for curve of means. 


Nee 
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Again, Population (B) has constants 
Mean = 1°8518, & = 3°569,207, 
B, = 0°219,333, = 3°157,676, 
and its frequency type is Type I. Therefore the curve representing the distri- 
bution of means of samples of 10 is of Type I, with constants 
Mean = 1°85818, >, = 1:128,682, 


= 0°021,933, = 3°015,768. 


We have, as before, 
r = 349°152,438, e = 20,554°430,847, 
m, = 273:187,591, M, = 73°964,847, 
b= 51:434,693, q= 40°476,020, 
10°958,780, y=  353°5420. 


The frequency curve representing the Distribution of Means of Samples of 10 
From Population (B) is thus 


273°187,592 1 73°964,847 
y = (1 ( 10955750) 


The Mean is 1°858,18 and Mean-Mode = — 0:084,541. Therefore the origin, 
which is the mode, is 1°942,721, and the range is from #=— 40°476,020 to 
x =10°958,780. Now the actual distribution (IV (a)) of means of samples of 10 
derived from Population (B) ranges from —2°5 to 60 units, so it is sufficient 
to plot the curve from # = — 4°5 to # = 45, 


The Plotted Ordinates of this Curve, which is shown in Fig. IT, are 


} | 
x | y x y x y % y 

| 
0°4137 | -1-0 242-950 0°25 344°913 3-0 5°6736 
| 15782 | | 285°652 0°50 319°821 3°5 1-0692 
| 52649 | 322°369 281-500 40 0°1385 
| 152917 | -0°25 345°082 1-0 234°611 00118 
-25 | 38°4765 -0:125 | 351:396 15 137°218 
83°3704 0:00 | 353°542 2-0 62°7644 
—1°5 154°493 0°125 | 351°382 2:5 21°9485 


Tests for Goodness of Fit. 


With the obvious exception of the tests for normality, the data for all tests 
of Goodness of Fit of the theoretical curves against the corresponding experimental 
distributions were obtained by first drawing the curves accurately to a large scale 
on prepared cardboard, and then planimetering the areas corresponding to the 
frequency groups of the experimental distributions. This is the most satisfactory 
method in practice, but is somewhat lengthy and laborious. 
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In this case, the tests for Goodness of Fit gave 
(1) For theoretical curve of means of samples of 10 from Population (A) against 
experimental Distribution I (a), ie. ticket drawing : 
15 Groups. x? = 8407. P = 0000. 
(2) For theoretical curve of means of samples of 10 from Population (A) against 
experimental Distribution III (a): 
16 Groups. 01. P ='9573. 
(3) For theoretical curve of means of samples of 10 from Population (B) against 
experimental Distribution IV (a): 
16 Groups. = 18°70. P ='2284. 


FIC. I. 


4200 HISTOGRAM OF 
DISTRIBUTION IZ @). 


[_]= FREQUENCY 2. 


100 


7 


-9.49 1943 
1-858 


Means of Samples in units of original Distribution (B) with sub-range 4 unit. 


Distribution of Means of 1000 Samples of 10 from the Smooth Skew Distribution (B). 
Fitted Curve from theoretical values of o, 8; and f» for curve of means. 


A. E. R. Cuurcu 343 


On the whole these theoretical curves of means of samples do not give as good 
fits to the actual distributions as did the Normal Curve. There is little in this 
beyond the fact that for mere purposes of representing distributions of means of 
small samples, the Normal Curve will give in all probability as good or better 
results as any other method, and therefore may be preferred often in practice on 
account of its relative simplicity. 


In the two cases when the sampling was carried out by Tippett’s Random 
Sample, that is when the randomness could be considered established, the fits to 
theory were P ="9573 and ‘2284; consequently we may conclude, fairly, that the 
Pearson Frequency Curves employed here as a result of the equations 

do represent adequately actual distributions of means of samples provided the 


assumption of an “infinite” sampled population be made and the randomness of 
the sampling is reasonably certain. 


It is fairly clear from this, that the poor fit of Distribution I(@) must be due 
to lack of effective shuffling of the tickets, although great care was employed in 
the attempt to ensure this shuffling was effective and the resulting samples to be 
of a quite random character. This difficulty is quite well known of course, but 
can be overcome to a great extent by the use of beads or similar material instead 
of the tickets. This point is dealt with in Part IV. 


Part III. Tue Disrrinution oF SECOND MOMENTS (¢*) OF SMALL SAMPLES 
FROM ANY “INFINITE” POPULATION. 


(a) Commencing with the various Pearson Curves as types of sampled popu- 
lations, many attempts were made to deduce without the use of moments the 
corresponding theoretical frequency curves of o* for samples. These were not 
successful, so recourse was necessarily taken to the method of moments. 


In Biometrika, Vol. x11. p. 198 (23), p. 194 (24) and (25), Professor Al. Tchouproff 
has given formulae for the moments of the distribution of o* of samples from any 
“infinite” population in terms of the moments of that population. In Biometrika, 
Vol. xvii. pp. 79—83, I have obtained these formulae afresh by a shorter method 
and have corrected an error in the expression for the 4th moment of the distri- 
bution of o? of samples. This investigation was carried out, because Professor 
Tchouproff’s results followed as a result of some very intricate algebra, whilst on 
close inspection the expression for the 4th moment was seen to be in error, so it 
was considered of some value, not only to correct the expression for the 4th 
moment, but also to check the others and to deduce all of them by a simpler 
analysis more in conformity with English statistical methods. 


‘ 
| 
= 
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The formulae*, thus corrected, for the moments of the distribution of o? of 
samples are 


1 
= — 30,0, — 60,? + 27,3] 
[30, — 21,0, — + + 330d, — 220,? + 540.5] 


M, = + — 45,5, — 249.5, — 155, + 485,5, + — 3074] 
— — 405.9, — 967.3, — 545, + + 5289.29, — 3067, 
+ [ 60, — 96,0. — 1767, 0, — 10207 + 9240, + 1232720, — 10440,*] 
; — — 1607, 0, — 950? + 10500, 0? + 13607,20, — 1395.) 
— 289, 0. — 560, 0, — 3507 + 4207, 0,2 + 560920, — 6307,4] 
where .M., .M,, etc. are the moments of the distribution of o? about its mean ,.M a 


D2, Ds, etc. are the uncorrected moments of the sampled population about its mean, 
and N is the number in the sample. 


Tchouproff’s formulae may be expressed in terms of the betas of the sampled 
population as follows : 


Mean = (1 De, 


M, = 68, +2 — (38,— 212, 188, + 26) 


(38, - 338, — 298, + 54) — ys 158,108, +30}, 


M,= 13 + 248, 158 + 488, + 968, 30) 


7 wi (48, — 408, — 968, — 548,’ + 3368, + 5288, — 306) 
+ vi (68, — 968, — 1768, — 1028,? + 9248, + 12328, — 1044) 
_ ¥ (48, — 888, — 1608, — 958.2 + 10502, + 13608, — 1395) 


— 288, — 568, — 358,? + 4208, + 5608, — 630)} 


{* The formulae (1) and (2) were known before the publication of Prof. Tchouprofi’s paper. Ep.] 
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These formulae, though theoretically exact, suffer from the very considerable , 
disadvantage that they involve the first eight moments of the sampled population. 
In the first place the large probable errors of the higher moments of the sampled 
population (this is in general merely a larger sample from a still larger population) 
cause formulae involving their use to be of doubtful value in practical statistics, 
and secondly the labour involved in calculating the higher of these eight moments 
is too great to be undertaken in ordinary statistical practice. 


(6) The consideration of the second of these points will be left to the next 
section, the purpose of the present section being to determine whether or no these 
formulae do give good representations of actual experimental distributions of o* of 
samples in spite of the fact that they, the formulae, contain high moments whose 
exact size is questionable. To this end Tchouproff’s formulae were applied directly 
to the betas of Populations (A) and (B) given in the Introduction. 

The results obtained were : 


(1) The Distribution of o* of Samples of 10, obtained by the application of 
Tchouproff’s Formulae to the actual uncorrected Moments of Population (A), had 
for its constants : 

Mean = 13°282,791, M,= 42°047,408, 
M, = 360°510,852, = 11,747°805,045, 
and Standard Deviation 
= 6°484,397, 
which gave betas of 
2B, = 1°748,312, = 6°644,744. 

(2) The Distribution of o* of Samples of 10, obtained by the application of 
Tchouproff's Formulae to the actual uncorrected Moments of Population (B), had 
for its constants: 

Mean = 11°465,313, 31:284,573, 
2M, =179'342,927, 2M, = 4,445°996,023, 
and Standard Deviation 
= 5°593,261, 
which gave betas of 
»B, =1:050,297, = 4°541,956. 

Tables of comparison of these constants with the corresponding constants of 
the actual Distributions I(b), III (6) and IV (6) of o* of 1000 samples of 10 from 
Populations (A) and (B) are 


Table for the Distributions from Population (A). 


Constant | Theoretic Value | Value from I (5) | Value from III (5) 
Mean 13°282,791 | 15°848 +0°138 13°725 + 0°138 
| 42°047,408 | 61°162,896 + 2°130,785 45 829,375 + 2°130,785 
= 360°510,852 | 701°313,792 + 74°847,038 505°969,031 + 74°847,038 
2M, 11,747°805,045 | 24,542°441,607 + 5,299°341,004 | 17,895-282,136 + 5,299°341,004 
2B, 17483 | 2°1496 + 0°5379 2°6596 + 0°5379 
2B, 66447 6°5606 + 2°6844 8°5202 + 2°6844 
22 6°4844 | 7°8207 +0°0978 | 6°7697 +0°0978 
| 
Biometrika xvi 23 
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Table for the Distributions from Population (B). 


Constant | Theoretic Value Value from IV (5) 
Mean | 11°465,313 11609 +0°119 
oM, | 31:284,573 32°204,119 + 1°255,819 
| 179°342,927 194°848,209 + 22°569,784 
4,445°996,023 | 4,857°431,837 + 702-936,762 
2B, 1:0503 1°1367 +0°1788 
2B, 45420 4°6836 + 0°5087 
5°5933 5°6749 +0°0843 
| 


In these fables the P.E.’s of the constants of the experimental distributions 
were obtained (in accordance with theory) from the constants of the corresponding 
theoretic distributions. The higher betas ,B;, .B, and .B, of the two theoretic 
distributions necessary to calculate the p.£.’s of ,.M, and .M, of the experimental 
distributions were calculated from the ,B, and .B, of the theoretic distributions by 
use of the difference formulae of the next section, this being the only feasible 
method. This fact, together with the general theory upon which the P.E. rests, 
causes the values of the P.E.’s given here to be of doubtful accuracy and utility. 
The indication they give of the size of the variations of the constants due to 
sampling is probably only of a quite approximate nature. 


The tables show very little beyond the large variations to be expected in the 
betas and higher moments of actual distributions of o* of samples. The subsequent 
tests for Goodness of Fit show that these variations have little significance ; for 
example, although the agreement between the constants of Distribution III (6) and 
the theoretic constants in the first table is strikingly less close than that between 
the constants of Distribution IV (b) and the theoretic constants in the second 
table, the fit in the first case is rather better than that in the second. 

With the application of the Goodness of Fit tests in view these two theoretic 
distributions of o? of samples of 10 from Populations (A) and (B) were now fitted 
with Pearson Frequency Curves. 


The Constants of the first being 
Mean = 13°282,791, 2B, = 1°748,312, 
2B, = 6644,744, >, = 6°484,397, 


the Skewness is ‘932 and it is represented by a Type IV curve with equation of 
the form 


y= 


where, in the usual notation, 
r = 11°434,579, m = 6°717,290, 
z= 1,926°017,102, v = — 42°370,612, a = 5°457,542. 


i 
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N rer iar 
where tan d = “= — 3°705,4807, 
we have log yy = — 14°416,779, 
and the curve is therefore 
zx 
542) 
The Origin = mean +“ =~ 6:940,026. 
1 oM, r—2 
The Mode = mean — 5 10°272,229, 


which corresponds to @ = 72° 25°6’. 
Putting «=a tan @ the curve becomes 
log y = — 14°416,779 + 13:-434,579 log cos @ + 18°401,324 @. 
Again, Distributions I(b) and III(b) range from v,=0°5 to v,=55°5, so it is 
sufficient to plot from @= 53° to 84°. 
The Plotted Ordinates of this Curve, which is shown in Fig. ITI, are 


6° x | y x y x y 
53 7°2424 0°4386 68 | 13°5079 49°3761 19°6798 64°6459 
| 5d 7°7942 1°0093 | 13°8548 53°2682 75 20°3678 60°8621 
8°4039 2°2095 69 14°2174 57°0318 75°5| 2171027 56°4311 
59 9°0620 4°5768 14-5969 60°5757 76 21°8890 51°4749 
61 9°8457 89122 70 | 14°9945 63°8058 22°7323 46-1399 
62 10°2642 12°1219 705 | 15°4116 66°6226 77 23°6392 40°5882 
63 10°7110 16°1846 71 | 15°8498 68°9275 ei 25°6757 29°5106 


64 11°1896 | 21-1833 71 16°3109 70°6270 79 28-0766 19°5119 
65 11-7037 27°1414 72 | 16°7966 71°6367 80 30°9513 11°5241 
66 12°2578 33°9870 725 | 17°3091 71°8859 81 34°4576 5°9385 
66°5| 12°5515 37°6856 17°8508 71°3242 2 38°8324 2°5862 
67 12°8572 41°5184 73°35 | 18°4243 69°9243 83 44°4481 0°9103 
67°5| 13°1757 45°4356 19°0327 67°6879 8h 62°3800 0°0442 


The Constants of the second of these theoretic distributions of .c? of samples 

of 10 being 

Mean = 11°465,313, oB, = 1:050,297, 

2B, == 4541,956, >. = 5°593,261, 
its Skewness is ‘522 and it is represented by a Type III'curve with equation of the 
form 

a\P 
y=ye™ (1 + *) , where p = ya. 


In the usual notation we have for this curve 
= 0°348,880, p = 2'808,447, 
a = 8'049,894, 
23—2 
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N peo 
The equation of the curve is therefore 


Also Yo= Le. Yy) = 80°6342. 


y = 80°6342 "348.8802 (1 


8:049,894 

The Mean of the distribution is 11°465,313. 

The Origin (which is the Mode) = 8°598,998. 

The actual Distribution IV (6) ranges from v,=0°5 to v, = 40°5,so it is sufficient 
to plot from #=— 8-0 to a= 32:0. 


FIG: I. 
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o?’s of Samples in ;4, sq. inch of Barometric Height 


Distribution of o? in 1000 Samples of 10 from the Skew Distribution of Barometric Heights (A). 
Fitted Curve from actual moments of (A). 


The Plotted Ordinates of this Curve, which is shown in Fig. IV, are 


© y y x | y y 
00008 -0°5 80-1800 8 34°3576 2 09022 
| —7 3-0386 0-0 80°6342 10 23-7808 26 0°5322 
| —6 14:0355 OS 80°2157 12 15°8987 28 0°3109 
30°2227 1°0 79-0353 14 | 10°3348 30 071801 

47°2824 2 74:8409 16 6°5639 32 071035 

61°9925 8 689160 18 | 4:0885 

-2 72°6420 4 62:0113 20 | 2°3047 

78°7497 6 47°5042 22 | 1°5126 
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Tests for Goodness of Fit. 


(1) For theoretical curve of o* of samples of 10 from Population (A) against 
experimental Distribution I (0), i.e. ticket drawing : 


29 Groups. x? = 169°95. P=0. 


(2) For theoretical curve of o? of samples of 10 from Population (A) against 
experimental Distribution ILI (0): 


29 Groups. %=2397. P=-6831. 
| \ 
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o2’s of Samples in squared units of original Distribution (B) with sub-range one unit. 


Distribution of o? in 1000 Samples of 10 from the Smooth Skew Distribution (B). 
Fitted Curve from actual moments of (B). 


(3) For theoretical curve of o* of samples of 10 from Population (B) against 
experimental Distribution IV 


26 Groups. x? = 22:46. P=-6090. 


From these it may be concluded that the theoretical Pearson Curve of o° 
of samples of V from any “infinite” population, as obtained from its moments 
given by the Tchouproff formulae (1), (2), (3) and (4), does give an adequate 
representation of actual distributions for cases where N is small, provided the 
sampling leading to the actual distribution is of a reasonably random character. 
Moreover there is nothing to suggest that this representation will be any less 
accurate for larger values of NV, always providing the assumption of an “ infinite ” 
sampled population be legitimate. 
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Population (A) is a sampled population with frequency groups of irregular size 
and with a very pronounced “ tail,” whereas Population (B) is one with graduated 
frequency groups and no pronounced “tail.” From the fits obtained here it is 
evident that Tchouproff’s formulae give as good results in one case as the other, 
that is: the presence of a “tail” has little significance for the purpose of sampling 
provided the sampling is random. It is very interesting, however, to see the 
divergences its presence causes between the actual and theoretical values of the 
higher moments of the distribution of the o* of samples and still further the 
negligible ultimate effect that arises from these divergences as measured by x”. 


(c) From the previous section Tchouproff’s formulae may be deemed to give 
an efficient representation in practice of actual distributions of o? of samples from 
any “infinite” population, and therefore it is of importance to ascertain whether 
their practical use can be extended by eliminating the laborious process of 
determining the first eight moments of the sampled population. 


Now the sampled population being in general merely a sample from a larger 
population, the actual values of its higher moments are problematical as they are 
subject to large Probable Errors. Since Tchouproft’s formulae can be expressed 
in terms of the betas and second moment of the sampled population, it is possible 
therefore that quite good approximate results for the moments of the distribution 
of o* of samples may be obtained if, instead of the actual higher betas of the 
sampled population, any convenient good approximations to them are used. The 
convenience here will consist mainly in the ease and speed of the calculation 
necessary to obtain these approximate values. 

Now ussuming the applicability of Pearson’s Frequency Curves, the values of 


B,;, B, and B, can be obtained theoretically from the actual values of B, and Bz 
by the use of the difference formulae 


Bn (even) =(n+1){$8n1 (1 +42) [(1 — Ju — 1a), 

Bn(odd) =(n+1) {48,Bn.+(1 + $a) By} /(1 — 4n— 1a), 
where a= (28, —38, — 6)/(B. + 3), 
and the purpose of this section is to determine whether the representation of 
actual distributions of o° of samples is as good, when in Tchouproff’s formulae the 
actual 8;, 8, and 8, of the sampled population are replaced by their approximate 


values obtained by the use of the above difference formulae, as it was when these 
higher betas were given their actual values as in the previous section. 


Before the actual work performed with this object is given, it may be noted 
that the Tables XLII (a), (b), (c) and (d) of the higher betas calculated from 
8, and 8, by these difference formulae, given in Tables for Statisticians, pp. 73—79, 
are in one or two cases in error. They have been re-calculated lately by Professor 
K. Yasukawa and are re-issued in the present number of Biometrika. Also, what 
is of more importance here, these difference formulae are based on the assumption 
that the sampled population is represented by a smooth curve, that is it is a 
continuous distribution. Therefore to apply them properly the 7,, %,, ¥, of the 
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sampled population should be first corrected by Sheppard’s corrections and the 
B, and 8, calculated from these values, then the difference formulae (or tables) 
should be applied to obtain the higher betas of the sampled population and the 
Tchouproff formulae applied to them to obtain the first four moments of the dis- 
tribution of o? of samples of V. Sheppard’s corrections reversed should then be 
applied to these four moments and from the resulting uncorrected moments the 
two working values of the betas of this distribution obtained. 


This procedure is such that, if carried out, the time and labour involved would ~ 
be little less, if any, than that involved in the calculation of the first eight moments 
of the sampled population. Consequently, unless the straightforward application 
of these difference formulae as they stand gives a reliable representation of actual 
distributions of o* of samples there is, for practical statistics, little point in their 
use. 

With this consideration in view, in the following work, the difference formulae 


have been applied quite straightforwardly to the first two betas of the sampled 


populations to obtain the values of the higher betas to be substituted in the 
Tchouproff formulae. 


The Constants of ‘the original Populations (A) and (B), when the higher betas 
were calculated by the difference formulae, were 
(1) For Population (A): 
= 14758657,  B,= 0°189,424, 
B.= 3:160,978, 8,= 1:865,497, 
B,=19°012,510, 8, = 182°728,377. 
(2) For Population (B): 
= 12°739,237, 0°219,333, 
B.= 3°157,676, 2°120,111, 
B,=19°058,578, 8, = 182°723,685. 
On application of Tchouproff’s formulae it was found that the Constants of 
the Distribution of o? of Samples of 10 from Population (A) were 
Mean = 13°282,791, 42°047,408, 
M, = 292°350,383, 2M, = 8,615°709,286, 
which gave >. = 6°484,397, 2B, = 1:149,714, 
and = 4°873,181; 


whilst the Constants of the Distribution of o* of Samples of 10 from Population 
(B) were 


B, = 
B, 


Mean = 11°465,313, .M,=  31:284,573, 
.M, = 186066245,  .M, = 4,709°529,436, 
which gave >= 5°593,261, eB; = 1°130,522, 


and »B, = 4°811,178. 
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The fitting of Pearson Curves to these two distributions gave the following 
results : 


For the first, « = 3°736 ; consequently a Type VI curve gives the best fit. 
The equation is of the form y = y (a — a)®/a%, and in this case 
r=—54978,810, e¢=—276:156,622; 


the quadratic is 2° + 54°978,8102 — 276°156,622 =0; 
its roots are — 59°611,422 and 4°632,612. 
Therefore 60°611,422, = 3°632,612 (.M, is positive) ; 
ae 
also M, = P(r+l1) 
Accordingly = 157°615,481 ; 
also log = 133'033,978. 


Thus we have the curve: 
y = 10598 — ae, 
The Origin = — 157°613,650. 
The Mode = 10°050,406. 


The actual Distributions I(b) and IIT (6) range from v, = 0°5 to v, = 55'5, so it is 
sufficient to plot the curve from « =158°0 to « =213°0. 


The Plotted Ordinates of this Curve, which is shown in Fig. V, are 


| x | y oA y x y x | y 
| 168 | 00183 167 69°8821 175 | 38-4848 188 | 3°8014 
| 159 | 13106 | i6766| 70-4309 | 176 | 333829 | 190 | 95932 
160 | 6°4578 168 7*2998 287042 193 13469 
161 | 15-7972 169 68:5138 178 | 24-4908 196 | 0°7107 
162 | 27-7962 170 65°0586 179 | 20°7534 199 | 03720 
163 40°3752 171 60°4533 180 17-4800 202 | 01937 
| 164 | 51-7445 172 55°1579 182 | 12-2091 205 0°1005 
| 165 | 60°7301 178 49-5505 184 8°3823 208 0°0521 
| 166 | 66-7905 174 43-9209 186 5-6762 213 0-0174 
| 


For the second, « = 4°713, so these data are likewise best fitted by a Type VI 
curve. 


Here r = — 69°690,784, € = — 326°992,335 ; 
the quadratic is 2* + 69°690,7842 — 326°992,335 =0; 
its roots are — 74103,432 and 4°412,648. 

Therefore qi = 75°103,432 and q.=3'412,648 (,M, is positive) ; 
also a = 178°657,362 and log y, = 169°391,180, 


and the curve y = 10118 (g — 178°657,362 #1268 
The Origin = — 178°504,191. 
The Mode = 8°657,465. 
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The actual Distribution IV (b) ranges from v, = 0°5 to v,=40°5, so it is sufficient 
to plot the curve from «=179-0 to « =219°0. 


MEAN. 


MODE. 


50 


FIC:XZ. 


HISTOGRAM OF 
DISTRIBUTION II(b) . 


\ (C= FREQUENCY 2. 


| 


15 B-283 295 565 
o?’s of Samples in 45 sq. inch of Barometric Height. 


Distribution of ¢? in 1000 Samples of 10 from the Skew Distribution of Barometric Heights (A). 
Fitted Curve from B, and B, of Distribution (A), higher B’s by the Difference Formulae. 


The Plotted Ordinates of this Curve, which is shown in Fig. VI, are 


x y | y | y x | y 
| 
179 0°0404 187 81°6889 194 | 41°3504 206 3°2752 
180 2°8129 18716 | 81-7382 195 34°8630 208 | 20171 
181 12°4006 188 | 80°5368 196 | 29-0766 S10 | 11-2312 
182 27°5800 189 | 76:5008 197 24-0240 212 | 0°7462 
183 44°6443 190 =| 70°5232 198 | 19°6863 214 0°4497 
184 60°1402 191 | 63°4358 200 12°9475 216 | 0°2698 
185 71-8902 192 55°9085 202 | 83252 219 0°1247 
186 79°0347 193 | 48-4341 204 | 5°2587 
| 


The tests for Goodness of Fit gave the following results: 

(1) For theoretical curve of o* of samples of 10 from Population (A), when the 
higher betas of this population have been replaced by the values obtained by 
the difference formulae, against experimental Distribution I (6), i.e. ticket drawing: 

29 Groups. = 166-72. P=0. 
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(2) For theoretical curve of o* of samples of 10 from Population (A), when the 
higher betas of this population have been replaced by the values obtained by 
the difference formulae, against experimental Distribution ITI (0): 

29 Groups. x? = 25°78. P =°5851. 

(3) For theoretical curve of o? of samples of 10 from Population (B), when the 
higher betas of this population have been replaced by the values obtained by 
the difference formulae, against experimental Distribution IV (0): 

26 Groups. x? = 23°99. P ='5200. 


|| FIG: W. 
| (J="REQUENCY 2. 
| 
| \ 
| 
8-657 41-5 


115465 
o?’s of Samples in squared units of original Distribution (B) with sub-range one unit. 


Distribution of oc? in 1000 Samples of 10 from the Smooth Skew Distribution (B). 
Fitted Curve from B, and 2 of Distribution (B), higher B’s by the Difference Formulae. 


The fits here are, for all intents and purposes, identical with those obtained 
in the previous section, consequently the chief difficulty in the application of 
Tchouproff’s formulae is abolished. 

We thus conclude that the moments of the distribution of o* of samples of V 
from any “infinite” population may be deduced with reasonable speed and ease 
by the application of Tchouproff’s formulae for the actual second moment, and 


| 
tre 


A. E. R. Caurcu 355 


by using the first two betas of the sampled population and the approximate values 
of its higher betas as obtained by the assumption of Pearson’s Frequency Types, 
together with the straightforward application of the difference relations which 
follows that assumption. Moreover the values of the moments of the distribution 
of o* of samples so obtained will lead to a representation of this distribution by a 
Pearson Frequency Curve, which will describe adequately any actual distributions 


of o* of samples obtained from an actual set of samples that are the result of 
truly random sampling. 


Part IV. THE DistrRiBUTION OF MEANS OF SAMPLES OF NV FROM A FINITE 
POPULATION OF M. 


If, in drawing a sample of NV, the individuals are drawn separately and replaced 
each time before the next is drawn, the sampled population, as we have seen before, 
can be treated as infinite in all its categories. If however the whole sample be 
drawn before replacement, the sampled population, in theory, must be treated as 
of finite total frequency M. However, as we shall see later, even when the samples 
are actually so drawn in practice, the resulting distributions can be represented 
approximately under certain circumstances by theoretical curves obtained by 
treating the sampled population as infinite, the loss in accuracy being small whilst 
the saving of time and labour is considerable. The determination of reliable 
conditions under which this can be done is a matter of some difficulty and will 
be discussed more fully later; but, in order that this consideration should not 
affect the discussion, in Parts IV and V a sampled population has been chosen 


which clearly cannot be treated as infinite when the complete sample is drawn 
before replacement. 


This population, which will be called Population (C), is the binomial distri- 
bution (4 + 8), having a total frequency of 36. 


The distribution, its moments and betas are as follows: 


| Designation of the Frequency Group A | B Cc 


| Central Abscissa 
| 


Frequency... | 1 | 10 25 


Mean= __1°666,667, ve= 0°277,778, 
= — 0°185,185, 0:277,778, 
v; = — 0°390,950, v,= 0°620,722, 
1°664,817, o= 0'527,046, 
B,=  1°600,000, B.= 3°600,000, 
B,= 12°160,074, B, = 28°959,690 
and By = 279°624,976. 
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Now the various types of samples of 10 from this population of 36, with their 
frequencies and means, are : 


Type of Sample | Frequency Mean | 
| 
0A OB 10C | 3,268,760 2°0 
0A 1B 90C 20,429,750 19 
04 2B 48,670,875 1:8 
0A 3B 7C 57,684,000 17 
04 4B 60C 37,191,000 16 
04 5B 5C | 13,388,760 1°5 
0A 6B 4C | 2,656,500 
o4 373 8C 276,000 1°3 
ae: | 13,500 1-2 
0A 9B 250 
0A 10B 1 1-0 
14 OB 9¢ 2,042,975 1°8 
14 1B 10,815,750 
14 2B 7¢ 21,631,500 1°6 
14 3B 6C 21,252,000 1°5 
14 4B 11,157,300 1:4 
14 5B 4¢ 3,187,800 
14 6B 3¢ 483,000 1:2 
14 7B 2¢ 36,000 11 
14 8B 1,125 10 
14 9B OC 10 0-9 


The total frequency of the samples is, of course, 36!/(10! 26!) = 254,186,856. 
Taking the means of samples, we have the following discrete frequency distri- 
bution, which is the true or complete distribution of means of samples in this case. 
The Complete Distribution of Means of 
Samples of 10 from (C). 
Value of F Pe 
the Mean 


Aas 
— 


1,126 
36,250 
496,500 
3,463,800 
13,813,800 
34,640,760 
58,822,560 
68,499,750 
50,713,850 
20,429,750 
3,268,760 


Cs 


on 


Total 254,186,856 


The constants are: 
Mean = 1°666,667, ,M, = 0:020,635, 
= —0°000,647, = 0°001,199, 
0143,649, ,B, = 0°047,697, 
,B, = 2°816,989. 
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Now the formulae for the Mean, Second Moment-Coefiicient and first two Betas 
of the Distribution of Means of Samples from a finite population are as given 


below, but their ultimate origin is doubtful, for I have been unable to ascertain 
the person who first deduced them. 


The fermulae are: 


Mean of Distribution of Means of Samples = Mean of Sampled Population...(1), 


_ (M-1)(M-2NY 


p, = 1) + M+ 8,4 3M 
N (M — 2)(M—3)(M-D) 
where 9, 8, and #, are the corresponding constants of the sampled population. 
On application of these formulae to the constants of Population (C) the results 
were: 
Mean of Distribution of Means = 1°666,667, 
iM, =0°020,635, 
1B, =0°047,697, 
= 2°816,989, 


which agree with the values given above, thus supplying a numerical check to the 
formulae. 


In the consideration of the Distribution of Means from any population when 
the complete sample is drawn before replacement, as the above formulae give the 
betas of the distribution, three considerations arise: 


(a) The adequate representation of this complete distribution by a Pearson 
Curve, the constants of which are those given by formulae (1), (2), (3) and (4) 
of this part. 

(b) The possible adequate representation of this distribution by a Pearson 
Curve, the constants of which are the values of ,M,, ,B, and ,B, given by the 


simpler formulae of Part I, which involve the assumption of an infinite sampled 
population. 


(c) A Distribution of Means will not be obtained actually in practice by 
theoretical methods as employed here to obtain the complete distribution from 
Population (C), but by the actual drawing of a limited number of samples. The 
question thus arises as to whether the Pearson Curve, as obtained in the first 


of these considerations, will represent adequately such a distribution of Means 
obtained by actual sampling. 


(a) Now it does not follow a priori that, because the constants obtained by 
the formulae of this part are the actual constants of the complete distribution 
of Means of Samples obtained from the population, the resulting Pearson Curve 
will represent the distribution, for it is possible for different distributions to have 
the same second moment and first two betas. 
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As we shall see in the case of the Distribution of o* it is possible, in extreme 
cases, for the representation by a Pearson Curve to fail completely. Now, un- 
doubtedly, Population (C) is extreme both in the paucity of its categories and in 
its smallness, whilst its skewness and lack of a true mode are abnormal. Therefore, 
if the true distribution of Means of Samples be adequately represented by a Pearson 
Curve in this case, we are practically justified in assuming that these formulae and 
a consequent Pearson Curve will represent, for practical purposes, any Distribution 
of Means of Samples from a finite population. 


A Pearson Curve is thus required whose constants are 
Mean = 1-666,667, >, = 0°143,649, 
=0°047,697, = 2'816,989, 
the unit being that of Population (C). 
The curve is clearly of Type I and therefore, in the usual notation, we have 
r = 20°850,646, e = 101°459,700, 
— 20°850,646m’ + 101°459,700 = 0, 
with roots 13°113,664 and 7°'736,982; as ,M, is negative, therefore 
m, = 13°'113,664 and m,’ =7°736,982 ; 
therefore m, = 12°113,664 and m, = 6°736,982. 
Changing, for convenience, the unit to ;4, of that of (C), we have 
b =13°'899,77, a,=8-932,13 and a,= 4:967,58. 

Now as the curve is to be used again later and, moreover, the total frequency 
254,186,856 of the complete Distribution of Means of Samples of 10 from (C) is 
so large, the total frequency of the curve has been taken as 1000. 

Accordingly : Yo = 269°791. 

The curve is thus 


2°113,664 
y = 269791 (1+ £96758) 


The origin, which is the mode, is at 16°666,67 + 0°190,15 = 16°856,82, whilst the 
range is roughly from # = —9°0 to # =5°0. 


The curve is shown in Fig. VII, and its ordinates are 


Abscissa Ordinate Abscissa Ordinate | 
0°001 —O0°5 256°229 
—6:0 0:077 0:0 269°791 
-5°0 1°420 0°5 255°379 
—4°0 10°837 1:0 214°612 
-3°0 45°603 2°0 96°793 
—2°0 122-873 3°0 17-570 
-1:°0 220°297 4°0 0°256 
? 
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For thé representation of the complete Distribution of Means of Samples of 10 
from (C) by this curve, we have 


Value of Mean hag in Actual Frequency 
1°3 and under 17°40 15°73 
55:56 54°34 
1°5 135°48 136°28 
16 227°76 231°42 
261°48 269°49 
1°8 206°40 199°52 
1°9 80°88 80°37 
2-0 15°04 12°86 
ui 
| 
oO 
| 4250) 
FIG: | 
{200 
HISTOGRAM = OF | 
DISTRIBUTION Milla. | 
FREQUENCY 8-[ | | 
| 
| 
12 1686 20 
16-67 


The small frequencies at the tails have been clubbed together so that the 
Goodness of Fit test may be applied. The actual frequencies are the true fre- 
quencies of the complete Distribution of Means of Samples of 10 from (C) 
multiplied by er to bring the total frequency down to 1000, as is 
necessary for comparison. This complete Distribution of Means is, as we have 
seen, a set of discrete frequencies, but as these discrete frequencies occur at 
equal intervals, we can represent it by a histogram by drawing rectangles, whose 
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areas are the separate discrete frequencies and whose mid-ordinates correspond 
to the values of the mean for which the discrete frequencies occur. Hence we 
can obtain a comparison between the curve and the distribution which is quite 
reasonable. 

The application of the Goodness of Fit test gave P ="993 with y?= 1:04 for 
8 Groups. Thus, even in this extreme case, a Pearson Curve represents perfectly 
adequately the true Distribution of Means of Samples. 

Still further, an inspection of the betas of this distribution shows that, even in 
this exceptional type of sampled population, the Distribution of Means approximates 
very closely to normality. This is so clear, especially in the light of the fits to the 
Normal Curve of previous distributions of means in this paper, that the deter- 
mination of the actual fit is not worth the labour. 

Thus it may be concluded that the Distribution of Means of Samples from 
practically any “finite” population can be treated for practical purposes as a 
normal distribution, or, if a better representation is desired, can be represented 
by a Pearson Curve obtained from its first four moments. 


(b) From the previous consideration, formulae (1), (2), (3) and (4) of this part, 
together with a Pearson Curve, will represent adequately an actual distribution 
of means from a “finite” population, i.e. a distribution of means of samples where 


each sample was drawn completely before replacement. Now, it has been shown 
that the formulae 


= 3 Coe (6), 
iB, = 3 


together with a Pearson Curve, will likewise represent an actual distribution 


of means from an “infinite” population, ie. a distribution of means where the 


individuals of the sample were drawn singly and replaced. 

It will be useful if a practical indication can be obtained as to when the results 
of these two sets of formulae are not significantly different, that is, an indication 
as to when the Pearson Curve obtained from either set will represent reasonably 
well in practice, not only distributions of means obtained by sampling conducted 
in accordance with the assumption made in obtaining the curve, but also dis- 
tributions obtained by sampling conducted in accordance with the opposite 
assumption. When this is so, it follows that a Pearson Curve obtained from 
the constants given by the simpler formulae (5), (6) and (7) will represent 
adequately in practice any actual distribution of means of samples obtained 
from sampling conducted by either method, that is, the assumption of an infinite 
sampled population will be legitimate, in practice, for the representation of dis- 


tributions of means arising from sets of samples obtained either by drawing the 
complete sample, or individuals singly, before replacement. 


| 
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= The following tables give approximate values of ,B, and ,B, for values of M, N, 
8, and 8, covering a fairly extensive range. For convenience, the value M of the 
total frequency of the sampled population has been expressed as a multiple of 


the magnitude NV of the sample. In each table the row opposite 7 =o gives the 


values of the constants for samples drawn from an infinite population, that is, 
the values calculated by formulae (5), (6) and (7). 


Tables giving the values of ,B, for various values of M, N and B,. 


B,=0'1. 
MIN N=2 N=5 N=10 N=50 N=100 
0°050000 0-020000 0:010000 0°002000 0:001000 
100 0°049242 0°019519 0:009730 0:001941 0:000970 
50 0:048470 0°019036 0:009461 0°001883 0:000941 
10 0°041701 0°015123 0:007330 0°001431 0:000713 
5 0:031641 0°010208 0:004785 0°000911 0:000453 
B,=0°5. 
M|N N=2 N=5 N=10 N=50 N=100 
0°250000 0°100000 0:050000 6-010000 0°005000 
100 0°246212 0°097595 0048651 0°009707 0°004852 
50 0°242348 0°095181 0°047304 0°009415 0°004705 
10 0°208505 0:075617 0°036651 0°007154 0°003566 
5 07158203 0°051040 0°023926 0°004555 0:002264 
B, =1°0. 
MIN N=2 N=5 N=10 N=50 N=100 
0) 0500000 0°200000 0°100000 0°020000 0:010000 
100 0°492425 0°195191 9°097302 0°0i19414 0°009704 
50 0°484696 0°190363 0-094608 0°018831 0°009410 
10 0°417010 0°151235 0°073303 0°014308 0°007133 
§ 0°316406 0°102079 0°047852 0:009109 0°004527 
B, = 15 
MIN N=2 N=5 N=10 N=50 N=100 
0°750000 0°300000 0°150000 0°030000 0°015000 
100 0°738637 0°292786 0°145953 0°029121 0°014556 
50 0°727044 0°285544 0°141912 0°028246 0°014115 
10 0°625514 0°226852 0°109954 0:021462 0°010699 
5 0°474609 0°153119 0°071777 0°013664 0:006791 
Biometrika 
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B, 2:0. 
M/N N=2 N=5 N=10 N=50 N=100 
1-000000 | 0-400000 | 0:200000 | 0-040000 | 0-020000 
100 | 0°984849 | 0:390381 | 0°194604 | 0:038827 | 0019408 
50 | 0969392 | 0380726 | 0:189216 | 0-037662 | 0:018819 
10 | 0°834019 |} 0°302469 | 0146606 | 0-028616 | 0-014265 
5 | 0632812 | 0-204159 | 0-095703 | 0-018218 | 0:009054 


Tables giving the values of ,B. for various values of M, N and B,. 


M/|N N=2 N=5 N=10 N=50 N=100 
2°000000 2°600000 2°800000 2°960000 2980000 
100 2°010101 2°606406 2°803593 2°960780 2°980394 
50 2°020408 2°612823 2°807170 2°961554 2°980784 
10 2°111111 2°664303 2°835052 2°967449 2°983751 
5 2°250001 2°727273 2°867021 2°973887 2°986972 
B,=2 
M/|N | N=2 N=5 N=10 N=50 N=100 
2°500000 2°800000 2*900000 2980000 2°990000 
100 2°497398 2°798365 2°899085 2°979802 2°985900 
50 2°494586 2°796657 2°898142 2°979599 2°989798 
10 2°462963 2°780142 2°889438 2°977799 2°988894 
5 2°390626 2°750988 2°875166 2°975004 2°987501 
B, =3 
M|N N=2 N=5 N=10 N=50 N=100 
3°000000 3°900000 3°000000 3°000000 3°000000 
100 2°984695 2°990323 2°994578 2°998823 2°999406 
50 2°968763 2°980490 2°989113 2°997644 2°998811 
10 2°814815 2°895981 2°943825 2°988148 2°994037 
5 2°531264 2°774704 2°883311 2°976122 2°988030 
Be =4 
M|N N=2 N=5 N=10 N=50 N=100 
3°500000 3*200000 3°100000 3°020000 3°010000 
100 3°471993 3°182281 3°090071 3°017844 3°008912 
50 3°442940 3°164323 3°080085 3°0i15689 3°007824 
10 3°166667 3°011820 2°998212 2°998497 2°999180 
5 2°671876 2°798419 2°891456 2°977240 2°988560 


| | 
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M/N N=2 N=5 N=10 N=50 N=100 


2 4°500000 3°600000 3°300000 3°060000 3°030000 
100 4°444659 3°566198 3°281057 3°055887 3°027923 
50 4°391295 3°531990 3°260293 3-051780 3°025851 
10 3°870370 3°243499 3°106985 3°019196 3°009465 
5 2°953127 2°845850 2-907746 2°979476 2°989618 


To determine approximately whether a value in any column of one of these 
tables is significantly different or not from the value at the head of the column, 
we are practically compelled to use the ordinary criterion of the Probable Error. 
The difference between these values must be compared with the P.E. of the value 
considered and the difference counted significant or not according as it is of the 
same order as, or considerably less than, three times this P.E. 

The value of the p.£. will depend, of course, upon the total frequency desired 
for the distribution of means to be represented, but, apart from this, the value 
of the P.E. is still very doubtful as it has been obtained on the assumption that 
the higher betas of this distribution of means are given by the well-known difference 
formulae referred to in previous sections of this paper. This is, in general, only 
very approximately true, consequently the value of the P.E. can only be taken as 
a rough guide. However, this very roughness makes it possible to interpolate 
approximately in and between the tables, concise as they are, and so to obtain 
rough values of ,B, and ,B, for values of M, N, B, and P, not explicitly occurring. 
These, used in conjunction with this somewhat doubtful value of the P.z., will 
give at any rate an indication as to whether the difference in the two values of 
1B, or ,B,, as the case may be, is significant or not. 

It is thus probable that, by interpolation in and between these tables together 
with the evaluation of the p.E.* of the value of the constant so approximately 
obtained, a rough but fairly reliable indication can be obtained quite quickly as 
to whether in any given case the population sampled may be treated as infinite 
or not, that is, as to whether the method (complete samples or individuals drawn 
before replacement) of sampling is immaterial or not. 


In order to obtain some confirmation of this, two particular cases were in- 
vestigated : 


(1) Distributions of Means of Samples from Population (A). 
(2) The Distribution of Means of Samples from Population (C). 


(1) Distributions of Means of 1000 Samples of 10 from Population (A). 


Now we have the following four series of Distributions of Means for 1000 
Samples of 10 from this population: 


I(a) The Means of 1000 Samples of 10, where the actual sampling was con- 
ducted by drawing marked “ tickets” singly and replacing each time. 


* See Tables for Statisticians and Biometricians, pp. 68—71. 
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II(a) The Means of 1000 Samples of 10, where the actual sampling was con- 
ducted by drawing marked “ tickets,” the complete sample being drawn each time 
before replacement. 


III (a) The Means of 1000 Samples of 10, where the sampling was conducted 
by Tippett’s Random Sample. 


V(a) The Means of 1000 Samples of 10, where the sampling was conducted 
by drawing small commercial coloured beads, the complete sample being drawn 
each time before replacement. 

Now for Population (A) B, = 0'189,424, 8, = 3°160,978, and the total frequency 
is 4011. Thus it is easily seen by a rough application of the tables and the use 
of the P.E. obtained on the basis of 1000 samples, that the differences between 
the values of ,B, and of ,B, are in each case non-significant. 

Confirming this by actual evaluation from the formulae, we have 


(a) By formulae (2), (3) and (4) of this Part (Finite Sampled Population), 
,B, = 0°018,815 + 007,018, ,B,=3°014,569 + 111,921. 

(b) By formulae (5), (6) and (7) of this Part (Infinite Sampled Population), 
= 0°018,942 + 007,146, = 3:016,098 + °111,224. 

Clearly the differences are in each case non-significant. We should thus expect 
that the Pearson Curve with betas 0°018,942 and 3:016,098 would represent not 
only Distributions I(@) and III (a), but also Distributions II (a) and V (a) quite 
adequately. This curve has been given in Part II, Section (c), its equation being 


Now applying the Goodness of Fit test to the representation of these distri- 
butions by this curve, we have 

I(a) 15Groups, x*=8407, P=-0000. 
II(a) 19 Groups, y*?= 2428, P= 1466. 
III (a) 16 Groups, 701, P="9573. 
V(a) 16Groups, y?=13°'74, P='5455. 

The fit to II (a) is not very good, but it is much better than that to I (a), 
where the sampling was conducted in accordance with theory, and thus the poor- 
ness of fit can be ascribed principally to the difficulty of shuffling properly a large 
inass of tickets. The fit to V (a) is quite good, consequently two conclusions are 
suggested : 

Firstly that, when the tables of this section indicate the differences between 
the values of ,B, and ,B, are clearly non-significant, the population can be treated 
theoretically as infinite and the method of actual sampling (complete samples or 
individuals drawn between replacements) is immaterial. 

Secondly, when Tippett’s Random Sample is inapplicable or not obtainable, 
the inaccuracies involved in the use of “tickets” for actual sampling arising from 
the great difficulty of reliable shuffling can be very considerably reduced, for large 


populations, by using some more symmetrical and homogeneous material like small 
coloured commercial beads. 
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(2) The Distribution of Means of Samples of 10 from Population (C). 

Now the betas of (C) are 8, =1°6 and 8, = 3°6, whilst its total frequency is 36 ; 
thus it is easily seen, by rough application of the tables and calculation of the 
corresponding P.E.’s on the basis of say 1000 samples, that the differences between 
the two values of ,B, and the two values of ,B, are distinctly significant. Therefore 
the complete Distribution of Means of Samples of 10 from this population obtained 
at the commencement of Part IV cannot be represented by a curve or distribution 
obtained on the assumption that the population is to be considered infinite. 

For this population, as it is a binomial, not only have we the complete discrete 
Distribution of Means mentioned above, but we have also the discrete Distribution 
of Means obtained by considering the population infinite. That is, in this case, 


although we have seen the representation by a Pearson Curve would be adequate, 
it is not necessary. 


Assuming (C) to be infinite, ie. in any sampling individuals only are drawn 
before replacement, the Distribution of Means of 1000 Samples of 10 is the 
binomial * 

1000 ({ + 8)”, the unit being 34, of that of (C). 
Expanding this, and changing the unit back to that of (C), this distribution is 


| | 


| Value of Mean 0-9 1°0 1°2 13 | 


Frequency | 0002 | ‘0017 | -0135 | -0897 | -4935 | 2-2431 | 8-4116 | 25-8820. 


Valueof Mean| 1*4 15 1°6 17 1°8 19 20 


Frequency 64°7050 | 129°4100) 202-2030 | 237-8860] 198-2380 | 104°3360| 26-0840 


The two Distributions of Means are thus 


Frequency on the | Frequency on the 
ba of the (C) (C) 
— is infinite is finite 

0-6 00002 00000 
0-7 0:0017 00000 
0-8 0°0135 0-0000 
0-9 0°0897 0-0000 
1°0 04935 0°0044 
2°2431 071426 
1°2 8°4116 1°9533 
1°3 25-8820 13°6272 
14 64°7050 54°3461 
1°5 | 129°4100 136°2833 
202-2030 231°4189 
| 237°8860 269-4910 
1°8 198-2380 199-5179 
19 80°3745 
2-0 26-0840 12°8599 


* It may not be generally familiar, and it is indeed doubtful as to whether it has been published or 
not, although known in the Biometric Laboratory for some considerable time, that the Distribution of 
Means of Samples of V from an infinite population represented by the binomial (p+ q)" is the binomial 

p+q)", plotted to 1/N the unit of the original binomial. 
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In the second case the frequencies have been multiplied by 254,186,856 fo 
comparison. 

The fact that the first distribution in no way represents the second is obvious, 
but applying the Goodness of Fit test we have, for 8 Groups, 


x? = 35:07, P =-0000. 


We thus see that the tables of this section, together with the use of the 
Probable Errors of the constants, are quite likely to give a rough but useful 
indication as to when a given population can be treated as infinite and the 
sampling conducted by either method, that is, as far as the consideration of 
the Distribution of Means of Samples is concerned. 


(c) The representation of Distributions of Means of Samples, obtained as the 
results of actual sampling, by the Pearson Curve, whose constants are given by 
formulae (1), (2), (3) and (4) of this part. 


Now it was seen in ()) that a closer agreement between theory and practice 
was obtained when the actual sampling was carried out by small commercial 
coloured beads than when “tickets” were used. It is perfectly clear, a priori, that 
for perfect shuffling perfectly homogeneous and symmetrical material is required, 
but, when the sampled population is large, questions of cost and total weight of 
material are important. Tickets, which are cheap and light, have been frequently 
used but the results obtained are in general quite unsatisfactory, as has been 
shown in previous parts of this paper. Coloured beads, when small and having 
merely the rough homogeneity of size and shape that obtains in cheap commercial 
products, have been shown to be quite satisfactory when sampling from large 
populations, and moreover the variety of shades of colour enables them to be used 


when the sampled population, like Population (A), has a large number of 
categories. 


Coming now to actual sampling from small populations, it seemed clear that 
“tickets” would be unsuitable, so an attempt was made to use the same beads as 
had been used with success when sampling from the large Population (A). 


Population (C) has three categories A, B and C, of frequencies 1, 10 and 25, 
and these were represented by the appropriate number of red, orange and yellow 
beads taken from the 4011 used for Population (A). These 36 beads, it may be 
remarked, were by no means uniform in shape; they were none of them really 


spherical, and since their weights varied from ‘512 g. to 864 g., so there was also 
little uniformity of size and weight. 


Before giving the results of the actual sets of samples made, it may be well to 
re-state the values of the constants of the theoretic Distribution of Means as given 


by the formulae. They are 
Mean = 1°666,667, =, = 0:143,649, 
= 0:047,697, B, = 2°816,989, 
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and the Pearson Curve with these constants is 
12°113, 664 6°736,982 
whose ordinates etc. have been given before. 
Distribution VI(a). The Means of 1000 Samples of 10 from Population (C) 


obtained by drawing beads from a bowl, the complete sample being drawn before 
replacement. 


1°8 | 2°0 


| 
Value of Mean! 1-2 | 13 | 1-4 | 15 | 16 | 1-7 


163 


Frequency | 5 | 13 | 74 | 197 | 241 | 226 


72 | 


The constants are as follows, the origin being 1°6 and the working unit 10 times 
that of the distribution, i.e. 10 times that of Population (C): 
v= 0°400,000, 2:200,000, Mean =1°640,000, 
v= 2360,000, »v,=—0114,000, =0'148,324, 
vs = 2°590,000, v,= 12°728,800, ,B, =0°001,221, 
vs = 14°684,000, ,B, =2°629,917. 
Distribution VII (a). The Means of 1000 Samples of 10 from Population (C) 


drawn as in VI (a), but a different set of 36 beads being used. These beads were, 
however, merely another 36 from the 4011 used in V (a). 


| | 
| Value of Mean} 1:3 | 144 |] 15 | 16 | 1:7 | 1:8 | 1:99 | | 
Frequency 4 7 43 | 108 | 244 | 276 | 204 99 15 


The constants are (working unit as before) : 
0°810,000, 1:973,900, Mean =1°681,000, 
vs = 2°630,000, v,=—0°684,018, =0140,496, 
= 4644000, v,= 11:801,297, ,B, =0:060,836, 
v, = 17°786,000, =3°028,862. 

These are likewise discrete distributions, but as the intervals are equal, they 
were represented, as in the previous case, by histograms and tested for Goodness 
of Fit with the curve y = 269°791 (1 + eens) (1 _ are) given 

above. 


The results were 
Vi(a) x?=5218, P=-000,000 with 8 Groups. 
VII (a) x?=16°82, P=-018,786 with 8 Groups. 
The fits here are very bad. 


i 
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A glance at the values of the constants of VI (a) and VII (a) will show that 
these poor fits are largely due to displacements of the means of these distributions 
from the theoretical value 1:666,667. Now each of these two distributions was 
obtained by taking four sets of 250 samples, each 250 samples being completed 
at one sitting. When the fits were discovered to be poor owing to displacements 


of the means, the separate means of these 8 sets of 250 samples were evaluated 
with the following results : 


VI (a) 1st set of 250 Samples Mean = 1°6448. 


Mean = 1°6440. 

is Mean = 16260. 

VII (a) (Different set Ist ,, , ,, Mean = 1°6740. 
of beads) ,, ,, Mean = 1°6916. 

Mean = 1°6860. 


Now the p.E. of the Mean 1:666,667 on the basis of 1000 samples is 0°003,064, 
consequently it is clear that with the set of beads used in VI(a) the mean was 
consistently significantly below its true value, whilst with the set used in VII (a) 
the mean was consistently significantly above its true value. It is clear therefore 
that the lack of homogeneity of shape and size of the beads tends to produce a 


definite bias in the sampling, and this bias will depend on the particular beads 
chosen. 


We may thus conclude that, although the beads may be sufficiently homo- 
geneous to produce effective sampling from a large population, yet when samples 
are taken from a small population, material which is very close to the ideal of 


complete homogeneity of size and weight together with complete symmetry of 
shape is absolutely requisite. 


To this end 1000 fresh samples were made with clay marbles as the medium 
instead of beads; steel spheres would have been ideal as regards the conditions 
of uniformity of size and shape, but they would have been difficult to mark so 
that the categories of the population were readily distinguishable. These marbles 
were not quite of uniform size and weight (their weights ranged from 3°1535 g. 


to 3°8955 g.), but they were much more spherical and therefore much more uniform 
in shape than the beads. 


Distribution VIII (a). The Means of 1000 Samples of 10 from Population (C) 
drawn as in VI (a), but by the use of coloured spherical clay marbles. 


Value of Mean) 1°2 1°6 1°8 20 


13 | 14 | 15 


| Frequency 2 | 12 dd 147 | 250 | 257 | 196 


: 
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The constants are (working unit as before) : 
= 0°661,000, »,= 2:000,079, Mean = 1-666,100, 
2°437,000, »v,=—0341,961, =0141,424, 
v3 = 3°913,000, 11°274,984, =0014,615, 
= 15°805,000, 1B, =2°818,523. 


Representing this discrete distribution by a histogram (the intervals are equal) 
and testing for the Goodness of Fit of the representation by the curve 


£ 12°113,664 6°736,982 
y = 269791 (1 ) (1 
the result was 


VIII (a) x*=7°01, P=-4288 with 8 Groups. 


The fit here (shown in Fig. VII, see p. 359) is quite reasonable, but doubtless 


could be improved by performing the sampling with even more homogeneous and 
symmetrical material. 


Distribution VIII (a), being an experimental distribution of Means of Samples 
obtained by actual sampling from (C), can be fitted to the true Distribution of 


Means given at the commencement of this part, as a verification of the efficacy 
of this material. 


The fit was y*= 5°63, P ="5842 with 8 Groups, providing a satisfactory com- 
parison, but in general practice the true Distribution of Means would be unknown 
and the comparison could not be carried out, at least not until our knowledge is 


extended to the direct establishment of the Distribution of Means from any finite 
population. 


We see thus that for sampling from a small population, if there is to be any 
good agreement between theory and its representation as regards the distribution 
of means obtained from actual sets of samples, the material employed must approxi- 
mate closely to the ideal of complete homogeneity of shape, size and weight. This 
material must be chosen therefore with more care than perhaps has been deemed 


necessary in the past; for instance, I am convinced, cardboard tickets are quite 
useless. 


Part V. THE DISTRIBUTION OF o? OF SAMPLES OF NV FROM A 
POPULATION oF 


The Mean and Second Moment of this Distribution have been given by 
Dr Neyman in the paper to which previous reference has been made, but he did 
not attempt the much more laborious task of the determination of the Third and 
Fourth Moment-Coefficients, which are necessary if the distribution is to be repre- 
sented by a Pearson Curve. 
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Dr Neyman’s results are 


= (M— 1? (M—2)(M—3 


y — 
— (M°N —3M*+6M —3N—3)] ...... (2). 
To obtain the Third and Fourth Moments ,M, and ,M, the following expansions 
are required in addition to those given in Biometrika, Vol. xvi. p. 80. 


S being the summation over any finite distribution of X’s we have, for the 
evaluation of .M,: 


S(XY)S(X)= 
S(X){S(X)P= +8 (Xs + WS 
+ 38 + 6S (XY X_X,) 
+ 25 + 39 (XP X2X,) +69 (XPXX,X,), 
= S (X,°) + 28 (X;X,7), 

+ 28 (XPX,) + 39 +28 (X4X_X,) 
+ 4S + 48 (XP X2X,) + 6S 
+48 (X2X2X,X,), 

+89 +169 +249 X,X,) 
+ 18S + 248 (X2X2X,X,) 
+ 248 (X2X,X,X,X,), 
S(X,) 8S (XY) 8(XY)= S(X,°)+ + + 
+ 8 (XPXX;), 
S(XV)S(X,)= 
S(X,)S(XY)= (XX), 
S(X,°) {8 + W(X, X,) + (X,°X.7) + 
S(XS) +8 
S(X2)S(X,)= + + 8 (X EX.) 
+S(X'X2X;), 
+39(X"X,) + 38 +68 (XX, X,) 
+9 +39 (X)X2X,) + 6S (XPX_X,X,), 
+ 8 + 28 (X,'X,) 
+S 
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S(X,) 8 {S(X) P= + 2S (X,7X,) + WS + WW (XX, 
+ 28 (X°X,%) + 28 (XX2X,) + 2S 
+ 2S X;) + 2S + W (Xt X2X,X,), 
S(X,) {S(X,)}*= +48 (X,7X,) + 6S + 128 (XX, X,) 
+ 48 + 12S (X°X2X,) + 248 (XY 
+ 2S + 48 (XX2X,) + 6S (X 
+128 (XXPX,X,) + 248 (XX,X,X,X;), 
{S(X)P (X2) = (X,°X,%) + 2 + W (XPXFX,), 
= (X,8) + 28 + S (X,°.X.2) + W X_X,) 
+ 2S (X,°X,3) + 4S + 45S 
+ 28 + 48 (X 2X3 X,X,), 
S(X,%) = S + (X7X,) + WW (X,°X) + 38 (XP XY) 
+ 2S + 2S + S 
+ 28 + 48 + 2 (X3X2X2X,), 
S(X,%) (X17) 8 (X8) + 38 + 48 + 6S (X,°X_X;) 
+ 5S + 6S (X°X2X;) + 6S (XX, X,X,) 
+ 6S (X,+X,*) + 9S + 6S 
+ 6S (XY X2X,X,) + 88 (XP 
+128 (X2XIX,X,) + 6S (XPXZXZX,) 
+ 6S (XX2X,X,X;), 
S(X,°) S(X,8) + 5S + 108 + 208 (X,°X,X,) 
+ 11S + 30S (XX2X;) + GOS (XXX, X,) 
+ 108 + 258 (X4.X3X;) + 308 
+ 60S (XXPX,X,) + 1208 
+ 208 (X°X3X7)+408 (X°XIX,X,) +308 (XP XZX2X,) 
+ 60S + 1208 XX, X,). 
The Third and Fourth Moment-Coefficients of the Distribution of o* of Samples 
of N from a population of M. 
For this we have 


where S represents the summation over the WV values of the variate forming the 


! 
sample and = represents the summation for the wit ivy samples, the sampled 


population X,X,...X, being referred to its Mean as origin. 


Therefore |! )} _3! ( (X,)} 


M! _ (SPY 
Ez 
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Using the appropriate expansions from the above and from Biometrika, 
Vol. Xvil. p. 80, we have 


M! 5 
N* wit — = (N (X,°) 6 (N—1)? 28S 

—6 1) (NV 5) =S 4[3 1f+ 2] =S (X,°X,) 
—12[(N — 2)?+ 1] + 24[3 — 2) +1] (X3X,X;X,) 
+ 6[(N —1)?+6 (N —3) +4] 3S (X2X2X,?) — 12 [(N — 3) + 6] (XPXPX,X,) 
+ 72(N —5) 3S — (X, 

Now taking any product X,*.X,° ... X,\, the number of samples in which it will 


occur is clearly ¥-"Cy_, = (v= ( ; therefore 
! 


where S is now the summation over the sampled population i ae oo 
Therefore we have 


= 1) |M-18(X,) -6(N-1) |M-2.8(X/X,) 

—4(N—1) [3 (N 
8 
+ 24(N-1) (V-2) (N- 2+ 
+6 + 4] 8(X2X2X¥) 
- 3) 48 (XeX2X,X,) 
+72 


—720(N —1)(N —2) (NW —8) (NW —4) (N—5)|M-—6 


where S is the summation over the sampled population X,X,... Xy. 


Now since the origin is the mean of this sampled population we have S(X,)=0, 
and thus the following equations: 
Mi, = S (X,), 
0 =S(X")S(X), 
= S(X,*) 
0 {8 (X,)}, 
= {8 
(X,2)}, 


0 
0 {8(X,)}, 

0 

0 =S8(X,’) 

0 
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where y, is the sth uncorrected moment about its own mean of the sampled 
population. 

Expanding the right-hand sides of these equations by use of the expansions 
above and in Biometr tha, Vol. xvit. p. 80, we obtain 11 linear equations to express 
the 11 sums 


S(X,°), S(XPX,), S(XAXY), S(XEXLX,), 
and 
in terms of and 
Solving these we have, after considerable manipulation: 
S (X,°) = 
S(X)X.) =— 
S = — Mp,, 
28 (X+X,X;) = — M*7,7., 
2S = M*v,? — 
S(X°X,?X;) = — — 
6S = + 2M*02 — 6Mp,, 
6S (X°X2X,?) = + — 
48 (X°X2X,X,) = + 2M*02 — —6Mb,, 
248 (X2X,.X,X,X;) = 24Mv, — 18M*9,0, — + 
720S (X,X,X,X,X,X,) = + 40M*0,2 — 15M*93 
Substituting the values for these sums in the last expression for .M,’ we 
obtain : 


2(N 2) (W—3) +30) , 6 (W—2) (W—3) 65) 


(M — 1) (M—- 2) (M — 1) (M—2) (M—3) 
72 (N —2)(N-—3) (N- 
(M — 1) (M — 2) (M-— 3) 4) (M —1) (M — 2) (M— 3) (M—-4)(M—5) 
+ 


aL (W —1) +5) 


3 (N —2)(N—3)(5N*-42N +95) 
(= 2) (M = 3) (M— 2) (M—3) (M—4) 
90 (N — 2) (N—3) 
(M = 2) (M—3) (M — 4) —-5) 
|2 +5)- 4N + 5) 
2 (N—2)(N-—8)(3N?—30N +65) 
+ 2) (M—3) + — 2) (M— 3) — 4) 
40 (NW — 2) (N—3) (W-4) (W- 
(M — 2) (M — 3) (M— 
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3 (N —2) (N—3)(N?-6N +15) 
+ —3N?+9N -15) + 


2) (NW — 3) — 4) (W—5) 


(M — 3) (M—4) (M — 3) (M— 4) (M—5) 
Transferring the origin to the mean of this distribution of o* of samples 
by .M,=.M, —3.,M,..M; —(.M,’)*, we have, using Neyman’s results (1) and (2): 
2 (N — 2) (N —3) (N*—15N +30) 6 (NW —2) (NV — 3) (3N*—30N + 65) 
(M—1)(M—2) (M—1) (M—2) (M-83) 


72 (N —2)(N—3) (N—4) —5) 


(M—1) (M—2)(M—3)(M—4) * (M—1)(M—2)(M—3)(M—4) 


(NW 4N (N-1)(N-2)(N-8) 
(NW — 2) (N—8) (5N*— 42 + 95) 6N (N-1)(N—2) —3) 
* (i — 2) (M— 3) ~ (Mf — 2) (M—3) 


(M—2)(M—3)(M—4) (if—2) (Mf —3) 4 


18 (NV — 2) (N —3)(N—4) (N—5) 


2Mp, 6 (WN —2) (N?- 
| (ay —6N +5)- 7. 
2) (8N*—30N'+ 65) 12 (N— 2) (N — 3) (NW 4) (N-5) 
2) (M—3) (M — 2) (M—3) (M—4) 


20 (NV — 2) (W — 3) (W — 4) (W—5) 
(M — 2) (M— 3) (M—4) (M—5) | 
1)(W2?-4N+9)_ (N-2)(N*-3N?+9N-15) 2N*(N—1) 


~M-1 M-1 M-2 (M—1) 
6N +15) 
(M—1)(M—2) (M-2)(M-3) 
9N 9(N—2)(N-3)(N-4) (N—-5) 
(M- 4) — 2)(M-—3) (M — 2) (M — 8) (M—4) 
_ 15 (N- 2) (N-3) (N -4) (3) 
Again 
M! »_ (XY) 
N3 Ne 
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Using again the appropriate expansions from those at the commencement of 
this portion of the work and from those in Biometrika, Vol. xvu. p. 80, we have, 
after simplification : 


M! 


+S (X,X,) 8N* +24N? — 24N + 8] 
+ — + 28] 
+8(XX,X,)[— + 72.N?—120N + 56] 
+8 [— 24.N* + 72N? — 104N + 56] 
+S (XpX2X,)[— + 120N2— 264N +168] 
[1442 —480N + 336] 
+S [(6N4— 24N* + 84N2—120N + 70] 
+8 + 192N2— 480N + 420] 
+S [— 24N* + 168N? — 360N + 280] 
+S(XX2X,X,) [— 24N* + 216.N? — 840N + 840] 
+9 (X4X,X,X,X,) [144N? — 1440 + 1680] 
[— 48? + 240N2— 560N + 560] 
+8 (XXPZX2X,) [— 48.N* + 336N?— 1200N + 1680] 
+ S(XPX3X,X,) [288N? — 960N + 1120] 
+(XpX2X,X,X,) [2882 — 1920N +3360] 
+9 (X)X,X,X,X,X,) [— 2880N + 6720] 
+S 2) [24N*— 96N* + 1440N + 2520] 
+9 (XpXeX2X,X,) + 432N2— 2160N + 5040] 
+S(X2X2X,X,X,X,) [288N? — 2880N + 10080] 
+9 (X;X,X,X,X,X,X,) [— 2880N + 20160] 
+9 (X,X,X,X,X,X,X,X,) . 40320}. 
Again, since ... = S ... X,), where 


S is now the summation over the sampled population X,X,... X ,, we have 


+ (— 8N* + 24N? — 24N + 8) | M— 2 

+ (4N*— 16N* + 48N?— 64N + 28) |M — 2 

+9 (XX,X,) —2) (-8N*+72N?— 120N + 56) |M-3 

+S (—24N* + 72N*— 104N + 56) |M —2 

+8 (X~X2X,) (NW — 2) (— 24N* + 120N* — 264N + 168) |M—3 


| 
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+ S(X~X.X,X,) (W—2) (N—8) (144.N* — 480N + 336) |M— 4 
2) (12N*— 48N* + 192N?—480N + 420) |M—3 
+8 (XX,X,) (N —2) (— 24N* + 168N? — 360N + 280) |M- 3 
+8 (NW —2) 8) (—24N'* + 216N*— 840N + 840) |M — 4 
+S (XX,X,X,X,) (NW — 2) (N—3) (NW —4) (144 — 1440N + 1680) |M—5 
(N —2) + 560N + 560) |M—8 
+ 8(XX2X2X,) (N — 2) (W — 8) (— 48N* + 336N*— 1200N + 1680) |M — 4 
+8(X"XPX,X,) — 2) (N—8) (288N* — 960N + 1120) |M— 4 
+9(X"X/X,X,X,) (N —2) (NW — 8) (W — 4) (288N* — 1920N + 3360) |M — 5 
+S (XX,X,X,X,X,) (N—2)(N-3)(N—4) (N—5) 2880N +6720) |M- 6 
+9 (X2X2X,X (N— 2) (NW —3) (24N*—96N? + 
—1440N + 2520) |M— 4 
+S(XpXeX2X,X,) —2) (NW —3) —4) (— 48N* + 
— 2160N + 5040) |M—5 
(N —2) (NW —3)(N—4)(N—5) 
(288? — + 10080) — 6 
+S(X2X,X,X,X,X,X;) (N—2) (NW —8)(N—4) (NW —5)(N —6) 
(— 2880N + 20160) |M—7 
+9 (X,X,X,X,X,X,X,X,) (N —2) (NW — 3) (W — 4) (N—5) (N —6) (N—7) 
. 40320 |M — 8}. 


Now the origin being the mean of the sampled population, we have as before 
S (X,)=0, and thus we have the equations : 


= 8 

0 S(X), 
= (X,°) S (X2), 

0 =S(X,) (X)}, 
= S(X,°) S(X,), 

0 

0 =S(X,) {S (X,)}, 

= {Ss (X,‘)}, 

0 =S(X,) (XY) S(X), 
= 8 (X,4) 

0 {8 

0 =S8(X,‘) 


th 
i 
64 
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= {8S (X,°)}? S (XY), 
0 = (X1)}?, 
0 =8(X,*) {S (X2)P S(X,), 
0 S(X,°) {8 (X,)}', 
0 {S (X,)}', 
{s (X,*)}4, 
0 ={s {S (X,)}, 
0 = (8S (XP)? 
0 =8 {S(X,)}% 
0 ={S (X,)}* 

Expanding the right-hand sides of these equations by the appropriate ex- 
pansions, we obtain 22 linear equations to express the 22 sums S (X,'), S (X,X,), 
S(X°X2), S(XSX,X;), S(XPX3), S(XPXLX;), S(XPXX,X,), 
S(XPXPXYX,), S(XPXFX,X,), S 
and S (X,X,X,X,X,X,X,X,) in terms of the moments d,, D2, DED, 
and 7,‘ of the sampled population. 


: Solving these equations we obtain, after considerable manipulation : 
S(X,°) = 
S (X,' X,) = — Mi,, 
S (X\°X,°) = — 
28 (X,°X,X;) = 2Mp, — Mv, 72, 
S = — 
S(XX2X;) = — — M*7,7;, 
6S (X\°X,X,X,) = + 2M*d,0, — 
28 = 2Mv, — — + 
S(XtX3X;,) = 2Mv,— — 
28 (X'X2X,X,) = + + 2M — — 
24,9 (XtX,X,X,X,) = 24Mp, — 12M*7,0, — 8M*0,0, — 6M*02 + 02, 
28 = 2Mp, — — + 02, 
28 (X°X2X2X,) = + 4M*d,0, + — M* 
— 2M*p20, — 
48 (XX )X,X,) = + + 2M*0?2 — — 
6S (X3XZX,X,X;) = — 12M*d,0, — 14M*7,0, — 
+ 39,02 + 


Biometrika 25 
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1208 (X°X,X,X,X,X,) = + + 64M?,d, + 
— —20M*020, 
24S 2) = + 3M*d2 + — 6 M,, 
128 = 24Mp, — — 12M — 6 M202 
+ + — 
48S = + 64M + 33.M 0,02 
— 28M*p20, + 3M*7,4 — 120 
7208 (X2X,X,X,X;,X,X;) = 180M*y,5,2 + 160M*5,2,— 
— — — 480M?7,0, — 720M>,, 
403208 (X,X,X,X,X,;X,X,X;) = + + 
+ 1260M*p2 — — 1120 + 
Substituting in the last expression for ,M/, we have 
(AN! — 48N*— 640 + 28) 


Ds) 


(N —2) (—4N*+36N?—60N +28)... 
+ (M 1) (M 2) (2, 
(— 24.N* + 72N*—104N +56) 
Wri — v5) 
(W 2) (—24N* + 264N +168) (9, 
(M—1)(M—2) 
(NW — 2) (N —3) (24N?— 80N +56) 
(af — 1) 2) (M —3) 
(3N*—12N*+ 42N?— GON 435) 
M-1 
(N — 2) 4210). 
(20, — — + 
(N —2) (— 24N* +4 168N?—360N + 280) 
— 1) — 2) 
(N — 2) (N—3) (—12.N* + 108N*— 420N +420) 
(M—1) (Mf — 2) —3) 
+ — 65,) 


oM, 


(Mb2 


(25, — — 


(N — 2) (N—8)(N—4)(6N?—60N +70)... 
\ : 240, — 12 — 
6Mo2+ 3M5,52) 
(N — 2) (— 24.N* + 120N? — 280N + 280) — — 


(N — 2) (N —3) ( 24.N? + 168N 600NV + 840) + 45,5, 


(M— 1) (M— 2) (M—3) 
+ — — 2M?020, — 


. 
ae 
RS 

By 

} 

ay 

| 
q 
1 

| 
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(N — 2) (NW —8) (72N*~240N +280), 
+ (M —1) — 2) (M—3) + 


— M*525,— 


(NW —2) (N — 3) (N—4) (48N*—320N +560) 
—14M5,5,— + 3M5,52 + 
—2) (N—3) (W—4) (N—5) (— 24 +56) 
(1-1) (M—2) (M—3)(M—4) (M—5) (120% 
+ 0, + 30Mb2 — 15.M?9,0,2 — 


(N — 2) (N —3) (N+—4N? + 18N?— 60N + 105) 
+ (M—1) (M—2)(M—3) + 
M3 = 60s) 
Ph, 2) (NW —3) (NW — 4) (— + 36 N? — 180N + 420) 
— 6 Mp? + + 6.M?0,20, — 
—2)(N-—8)(N -4) (NV — 5) (6N* ~GON + 200) 
(M —1) (M—2)(M—3)(M—4)(M—-5) 
+ + 830Mb2 — + — 1205,) 
(7 2)(N- 3) (W — 4) (W —5) (N -6) 
(M 2) (M —3) (M— 4) (M— 5) 
+ 160M?p,20, — 15M*p,4 — 180 Mp2 — 384.Mp,0, — — 7205,) 
(N — 2) (NW —8) (N —4) (NW —5) (N—-6)(N—-7) 
(M—1) (M— 2) (M—3) (M —4) (M —5) (M—6) (M—7) 
(218407, + 3560.Mp,0, + 2688.Mp,p, + 1260 Mp2 — 


— 1120.M2525, + 105 


and hence 
N-1(. , + 186N? — 252N +127 
4 (N — 2) (3N*—50N* + 270.N? — 602N + 483) 
(M —1) (M- 2) 
6 (N — 2) (NW —3) (N*—40.N* + 390.N* — 1400N + 1701) 


(M — 1) (M—2) (M-3) 
— 4) (2N*— 45.N? + 280.N — 525) 
(M — 2) (M — 3) (M—4) 
240 (W — 2) 4) (N — 5) (3N?—18N +77) 
 (M=—1) (M-2) (M—8) (M—4) (M—5) 
2880 (NW — 2) (N—3) (NW — 4) (NV —5) —6) (WN - 7) 
(M — 1) (M— 2) (M— 3) (M—4) (M—5) (-6) 
21840 (W — 2) (W -3) (NW 4) - 


_ 48 (N — 2) 


(M—1) (M-2 2) 4) (M—5) (M—6) (M-7) 
25—2 


x 
i 
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M — 


(N + 224) 
M-2 

(N= 2) 41? + 1155N + 1477) 
(i= 2) 

2(N—2) (N38) (W—4) (10N*—17 LW? + 1020 — 1995) 


(M — 2) (M—3) (M—4) 
6 (N — 2) (N —3) (NW — 4) (N —5) (21N?—270N + 875) 
(M — 2) (M — 3) (M —4) (M —5) 

480 (W — 2) (N —3) (NW 4) (NW —5) (NW -6) (N -7) 
(M — 2) (M —3) (M — 4) (M—5) (M 
840 — 2) (N —3) (N — 4) (NW —5) (NW —6) (NV —7) 
(M — 2) (M —3) (M — 4) (M—5) (M —6) 


(N—12+6)— 


2 (N — 2) (6N*—33N? + 74N — 63) 


(W —8) (15N*— + 679) 
(iM — 2) (M—3) 


— 2) (N — 8) (NW — 4) (6N* — 144.N? + — 1680) 
— 2) (M—3) (M — 4) 


_ 16 (N —2) —3) (N — 4) (W —5) (8N*— 42.N + 133) 
(M — 2) (M — 3) (M—4) (M—5) 


_ 192 (N —2) (NW —3) (N —4) (NW —5) (NW -6)(N -7) 
(M —2)(M—3)(M—4)(M—5)(M-6) 


336 (N — 2) (N —3)(N—4)(N—5)(N 


(M—2) (M —3) — 4) (M — 5) (M— 6) (M—7) 


Mi? 


+ 


1)'+24(N —1)?+8 
_ 2(N — 2) (3N*— 24N* + 132N? — 300N + 245) 
M-2 
, (W (3N*— + 582N*— 2100N + 2555) 
(M — 2) (M—3) 
, 12 (NW —2) (NW —3) (W —4) + 280N — 525) 
(M — 2) (M—3) (M—4) 
60 (W —2) (W 4) (W —5) (8N* 42N +133) 
(M—2) (M—8) 4) (M—5) 
720 (N —2)(N —3)(N—4)(N—5) (N—6) (N—7) 


(Mf — 2) (M —3) (M — 4) (M—5) (M—6) 


1260 (W — 2) (VN —3) (NW — 4) (V—5) (NV (V—7) 
(M — 2) (M—3) (M—4) (M—5) (IM —6) (M—7) | 


3 
ve 
+ 
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6M? (N — 2) 0,02 
| + | +10(N—1)?—16 (N-1)+8 


— 3) (W*—10N* + 64.N® — 230N +315) 


(N — 3) (W — 4) (6N?— 81N? + 460 — 945) 
(M1 —8) — 4) 


_ (W-3)(N — 4) (W —5) (33N*— 390N + 1295) 
(M — 3) (M — 4) (M—5) 
_ 120(N —3) (N 4) (W 5) (N —6) (N—7) 
(M — 3) (M — 4) (M— 5) 


210 (N —3) (NW — 4) (N -5)(N —6)(N -7) 
(iM — 3) (M—4) (M—5) (M—6) (M—7) | 


8M* (WV — 2) 3 2 
~ (M—1) (M — 2) [sav-1) —6(N —1)'+14(N-1)-12 


_ (NW —3) (6N* — + — 245) 


M-3 
(N —3) (NW —4) (3N* — 57N? + 335 — 665) 
(N —3) (N — 4) (N —5) (21N? — 270N + 875) 
(i — 3) (Mf — 4) —5) 
80 (N —3) (NW —4)(N —5) (N —6) —7) 


140 (NW —3) (NW — 4) (N — 5) (NW —6) (NW -7) 
(M —3) (M — 4) (M —5) (M—6) (M—-7) | 


Ms (N —2)(N 
+ —% Ul —3) + 12(N —1)-32(N —1) + 60 


4(N — 4) (N*—9N?+45N — 105) 
M-—-4 
9 (N — 4) (N —5) (2N? —20N +70) 
(MI — 4) —5) 
60 (N — 4) (NW —5) (N —6) (N —7) 
(MM — 4) (M —5) (M — 6) 
105 (N — 4) (W —5) (N —6) 
(M—4)(M—5)(M—6) (M—1) 


Transferring to the mean of this distribution of o? of samples as origin, by 


— 4..My. —6 (ol)? — 


| 
| 
+ 
| 
+ 
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we have 
N—1 (_ 7 — GON? + 186.N? — 252.N + 127 
2) (3N* — 50N* + 270.N* — 602.N + 483) 
(M —1) (M — 2) 


_ 6(N —2) —3) (N*— 40N* + 1400N + 1701) 
(M—1)(M—2) (M -3) 
48 (N — 2) (N 3) (N—4) (2N? — 45? + 280N — 525) 
240 (N — 2) (N —3) (N — 4) (N—5) (3N?—18N +77) 
(M —1) (M — 2) (M—3) — 4) (M—5) 
2880 (N — 2) (NW —3)(N 4)(N —5)(N -6) 


(M—1)(M — 2)(M—3)(M 4)(M—5)(M- 
21840 (NW — 2)(N —3)(N —4)(N—5)(N-6)(N— 


* (il —1) —2) (M— 8) (M— 4) (M—5) (M— 6) (M—7) 


N (WN —1) — 21N* + 45N — 31) 
M-1 


M-1 


(N — 2) (8N* — +117N* — 217N + 224) 
+ 


2N (W (W 2) (W-3) (N* + 30) 
(M— 1) 2) 


_(N — 2) (N — 41.N* + 321. N*— + 1477) 
(M—2)(M-3) 


, ON (N —1)(N - 2) (W—8) (3N*— 30N + 65) 


(M — 1) (M —2) (M—-3) 
2(N —2)(N —3) (N — 4) (10N*—171 N* + 1020.N —1995) 
(M —2) (M—3) (M —4) 
72N (N —1) (N —2)(N —3) (NW —4) (W -5) 
(M — 1) (M—2) (M—3) (M—4) 
6 (N — 2) (NW —3)(N — 4) (N—5) (21N* — 270N + 875) 
(M — 2) (M—8) (M— 4) (M—-5) 


120N (N —1) (N 2) (WN —3) (N—4) (W -5) 

(M —1) (M —2) (M —3) (A —4) (M—5) 

_ 480 — 2) (NW -3) (NW — 4) (W—5) (N 6) (N—-7) 
(M — 2) 


_ 840 (W 2) (N —3) (N - | 


(M — 2) (M —3) (M — 4) —5) (M — 6) (M—7) 


| 

a 
| 
i 
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x 
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_ 2 (N — 2) (6N* —33N? + 74N — 63) 
| 1) — 124 4) — 
— 2) (M—3) 
_ (W-2) —3) (W 4) 144.N* + 890 — 1680) 
(M — 2) (M—3) (M—4) 


16 (N — 2) (N —3) (NW — 4) —5) (3N? — 42 + 138) 
(M3) (M — 3) (M —4) (M—5) 
192 (NW — 2) (N —8) (NW — 4) (N —5) (WN —6) (N -7) 
 (M —2) (M 3) (M— 4) (M —5) 
336 — 2) (N —3) (W — 4) (N —5) (N - 6) (N -7) 
(M — 2) (M — 3) (M— 4) (M—5) | 
Mo? 


4 9 2 
1)*+ 24(N-1)7+8 


2 (N — 2) (8N* — 24.N* + 132N? — 300N + 245) 
(N — 2) —3) (8N*— GON? + 582.N2— 2100N + 2555) 
(M — 2) (M 3) 
12 (N — 2) (N —3) (N — 4) (2N* — 45.N? + 280N — 525) 
(M— 2) (M—3)(9-4) 3 
60 (NV — 2) (W —3) — 4) (W —5) (3N? — 42. 4 133) 
(M — 2) (M — 3) (M — 4) (M—5) 
720 (N — 2) (N —3) (N — 4) (W —5) (N -6) (V-7) 
(M —2) (M —8) (M — 4) (M—5)(M-6) 
1260 — 2) (N —3) (NW — 4) (VY —5) (NW —6) 
(M — 2) (M —3) (M — 4) (M—5) (M —6) (M -7) 


(V — —3N +10) 


M-1 
(WN — 2) (N+ 4.N* +16N* — 40N +35) 
N?(N —1)?(N?- 6N +7) 
2N (N —1)(N —2) (N?—8N? +31N — 40) 
(M2) 


(N — 2) (N — 3) 10N* + 64.N? — 230.N +315) 


(M — 2) (M—3) 


4.N?(N —1)?(N — 2) (N —3) 
(M—1)? — 2) 


383 
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2N (N —1) (N — 2) (NW —3) (5N? — 42N + 95) 
(M—1) (M — 2) (M -3) 
(NW — 2) (N —3) (NW —4) (6N* —81N? + 460N — 945) 
(M-2)(M-3)(M-4) 

6.N? (N —1)?(N — 2) (N -3) 

(M —1)? (M — 2) (M - 3) 
(N — 1) (N —2) (N —3) (N -—4) (N—5) 
(M — 1) (M— 2) (M—3) (1-4) 
N —2)(N —3) (N 4) (WN —5) (83N?—390N + 1295) 
(M — 2) (M -3) (M — 4) (M—5) 

60.N (NW —1)(N —2) (N 3) —4) (N—5) 
~~ (M — 2) (M— 4) (M —5) 

120 (NW — 2) (NW 3) (NW — 4) (N —5) (N —6) (N —7) 

(M — 2) (M — 3) (M — 4) (M—5) (1-6) 
210 (N —2)(N —3) —4) (NW —5) (N —6) (N —7) 
(M — 2) (M —3) (M — 4) (M —5) (M —6) (M—7) | 


+ 


—1) (3N® — GN +5) 


N ( 
M-1 


A 


M-1 

_ — 2) (8N*—15.N? + 35.N — 35) 
M-2 

6.N (N —1) (N —2) (N?—4N +5) 
(M —1) (M— 2) 

(N —2) (N —3) (6N? — 51.N? + 180.N — 245) 
—2) (M-3) 
N (N-—-1) (N -2) (N —3) (3N? —30N + 65) 

(M— 1) (M — 2) (M -3) 
(N — 2) (N —3) (NW — 4) — 57.N? + 335. — 665) 
(M — 2) (M—3) (M— 4) 
12N (NW —1)(N —2) (N —3) (N —4) (WN - 5) 
(M — 1) (M — 2) (M —3) (M —4) 
(N — 2) (N —3) (NW — 4) (W —5) — 270N + 875) 
(M — 2) (M —3) (M— 4) (M—5) 

20.N (N —1)(N —2) (N —3) (NW — 4) 

(M —1) (M — 2) (M—3) (M- 4)(M-—5) 
_ 80(N —2) (NW —3) (NW — 4) (VN —5) (NV -6) (V-7) 

(M — 2) (M — 3) (M— 4) (M—5) (M —6) 

_ 140 (NW —2) (W —3) (W -—4) (N —5) -6) 


(M — 2) (M—3) (M — 4) (M— 5) (M—6) (M—-7) 


4 
= 
é 
» 
4 
ot 
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Mos (N —1)?(N?—3N +6) 


M-1 (M— 1? 
(N —1)(N— 2) (N?—3N? + 9N — 15) 
(M— 1) (M —2) 
2) (NW + 18N?— 60N + 105) 
(M — 2) (M—3) 
3N3(N—1)) 12N2(N—1)?(N—2)(N—3) 
(M—1:(M—2) 


12.N (N —1) (N— 2) (N—3) (N?—6N +15) 
(M — 1) (M — 2) (M— 3) 
4 (N —2) (N —8) (N —4) 9N? + 45N — 105) 
(M — 2) (M—3) (M=4) 
18N?(N —1)?(N —2) (N —3) 
(M—1) (M — 2) (M—3) 
36N (N —1) (N —2) (N —3) (N 4) (N—5) 
=1) (M—2) (M—3) (M-4) 
9 (N — 2) (N —3) (N —4) (N —5) (2N2—20N + 70) 
(M — 2) (M—3) (M—4) (M — 5) 
60N (N — 1) (N —2) (N —3) (N —4) (N —5) 
~~ (M — 2) 3) (M — 4) (M-5) 
60 (N —2) (N —3) (N — 4) (N—5) (N —6) (N—7) 
(M— 2) (M—3) (M—4) (M — 5) (M-6) 
(M — 2) (M—3) (M — 4) (M—5) (M- 6) (M—7) 


The Tchouproff formulae (1), (2), (3) and (4) of Part III are particular cases 
of formulae (1), (2), (3) and (4) of this part and are deduced by putting M infinite. 

The great length of these formulae (3) and (4) is of interest from several points 
of view. The method employed to obtain them can be employed to obtain the 
higher moments .M,, .M,, etc. of the Distribution of o? of Samples, and, indeed, 
any moment of a distribution of a higher moment of the sample, for instance 
the moments of the distribution of the fourth moments of samples. It is however 
immediately apparent when the problem is considered, that the algebra involved 
would be extremely laborious and the results obtained of too great length to be of 
any value. It is thus evident that the expression of the moments of various 
distributions, of the nature considered here, in terms of the moments (or betas) 
of the sampled population, although quite possible, leads to unduly complex results 
and some alternative method of expression is required. 

To return to ,M,, .M,, .M,; and .M,, the size of the last two formulae, together 
with the type of term involved, means that ,M, is in practice impossible of caleu- 
lation, even with a machine, when M is over 40 or 50; consequently any sampled 
population of fair size must be treated as infinite and Tchouproff’s formulae 
used even though the assumption be not really legitimate and thus the results 
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obtained only approximate. The labour involved, even when this assumption is 
made, is moreover still very considerable. 

Practical considerations thus compel us, when a representation of the Distri- 
bution of o* of Samples by a Pearson Curve is required, to use Tchouproft’s 
formulae when the population is anything except very small. When it is very 
small however the formulae given above may be used and a Pearson Curve 
obtained to represent the distribution. The possibility of such a representation 
will now be considered. 

The Distribution of o? of Samples of 10 from Population (C) of total frequency 
36, together with its representation by a Pearson Curve. 


Now the types of samples to be obtained from (C), together with the o of each 
type of sample, are: 


Type of Sample Frequency | o 
| 
0A OB 100 3,268,760 | 
04 1B 9C | 20,429,750 09 
OA 3B sC | 48,670,875 16 | 
OA 3B 7C | 57,684,000 | 21 
0A 4B 6C | 37,191,000 
0A 5B 50 | 13,388,760 25 | 
OA 6B | 2,656,500 24 
OA 7B 3C | 276,000 21 
OA 8B 2C | 13,500 16 | 
04 9B 1C | 250 09 | 
04 10B 0C | 1 00 | 
14 OB 9¢C 2,042,975 36 
14 1B 8¢ 10,815,750 | ‘41 | 
14 2B 21,631,500 | “44 
14 3B 6¢ 21,252,000 | 
14 4B 5¢ 11,157,300 ‘440 
14 5B 4¢ 3,187,800 | “41 
14 63 483,000 | 
14 7B 2¢ 36,000 | 2 | 
14 8B 1¢ 1125 | 2 | 
14 9B 0C 10 | 09 


Thus dividing the samples to be obtained into two groups according as they do 
or do not contain the element of the first category of (C), we have the two distri- 
butions of o* of samples as follows : 


Grovp I Grovp II 

— 

| 

Value of o* | Frequency Value of o? Frequency | 

} “09 10 ‘00 3,268,761 | 

| 20 | 1,125 09 20,430,000 
36,000 16 48,684,375 
| $6 2,525,975 21 57,960,000 

‘41 “2h 39,847,500 

‘4h 32,788,800 "25 13,388,760 | 

“4b | 21,252,000 | 
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Adding these we have for the complete Distribution of o* of Samples of 10 
from Population (C) : 


Value of o? | Frequency 


“00 3,268,761 


09 20,430,010 
“16 48,684,375 
-20 1,125 
“21 57,960,000 
2 39,847,500 
13,388,760 
+29 36,000 
| 2,525,975 
‘41 | 
| | 32,788,800 
| 45 21,252,000 


| Total Frequency = 254,186,856 


The constants of this distribution are: 


Working origin = ‘24. Working unit, the squared unit of (C). 


= 017,142,86, M, = 014,274,0, 

= ‘014,567,91, = :000,801,8, 

vy = ‘001,540,90, = :000,428,2, 
= -000,508,45, 


Mean = 0°257,143, = 0°119,474, 
2B, =0°221,053, 2B, = 2°101,626. 

A consideration of the two groups of o? of samples above and a comparison of 
them with the corresponding groups of the Distribution of Means of Samples from 
(C) given in Part IV illustrate clearly a number of important points that arise in 
sampling from a finite population, 

If the samples be divided into groups by the consideration of the numbers of 
individuals from certain categories of the sampled population which are to enter 
into the sample, these groups will lead to a number of sub-frequency distributions 
of means, or o* of samples as the case may be, which in turn combine into the 
complete distributions of those constants. 

The very nature or definition of a mean ensures that in the case of the distri- 
bution of means of samples, these sub-frequency distributions will be moderately 
skew and uni-modal, whilst the modes of consecutive sub-frequencies will occur at 
comparatively small intervals of the range of the mean and the ranges of con- 
secutive sub-frequencies will overlap except for small intervals at their ends. 
Thus the complete Distribution of Means of Samples of N will approximate quite 
closely to normality even when the number of categories in the sampled population 
is small and therefore the number of sub-frequencies of means small likewise. 
In fact for all sampled populations small or large the Distribution of Means of 
Samples of NV, where N is not large, will be strikingly normal in character. This 
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has been shown in the previous parts of this paper and has been well known 
before, at least for samples from large populations. 

When we turn to the Distribution of o* of Samples of N, where N is not large, 
matters are quite different. The value of o* measures the dispersion of the 
individuals of a sample from its mean, thus in any group of samples as defined 
above there is a tendency for many values of o* to occur twice causing the corre- 
sponding sub-frequency of o? of samples to be much more skew than in the case of 
the means of samples, although it is still uni-modal. However, this also causes the 
modes and ranges of consecutive sub-frequencies to differ comparatively widely; 
and thus even when the sampled population is large with many categories the 
Distribution of o? of Samples is still far from normal, being in general quite 
distinctly skew. When however the sampled population is very small with but a 
few categories, these wide differences between the modes and ranges of the sub- 
frequencies of o* of samples cause the Distribution of o? of Samples itself to be 
bi- or tri-modal and moreover to be a discrete frequency distribution with unequal 
intervals between frequencies. As a Pearson Curve is based on the assumption of 
a uni-modal frequency, it is thus not to be expected that, in this case, a represen- 
tation by such a curve will be reasonably accurate; accordingly when these formulae 
do give the moments of the Distribution of o* of Samples with moderate ease and 
rapidity their practical use is nullified by the failure of any adequate representation. 

This failure is clearly shown in the distribution of o? of samples of 10 from (C). 
Its moment-coefficients are 

2M,’ = 0'257,143, = 0°014,274, 

2M, = 0:000,802, 2M, = 0°000,428, 
and, when the values of the first eight moment-coefficients of (C), together with 
M =36 and N=10, are substituted in formulae (1), (2), (3) and (4) of this part, 
the values obtained for ,/,’, etc. are precisely the same. Consequently the formulae 
are quite efficacious, but as the Pearson Curve derived from these moment- 
coefficient values above proves to be a J-curve, the completeness of the failure is 
at once indicated. 

We can thus sum up the considerations which arise when dealing with the 
Distribution of o* of Samples from a finite population. Owing to the unwieldy 
character of the expressions for ,M, and .M,, it is a practical impossibility, even 
with a calculating machine, to evaluate them both when M is greater than 40 or 
50. Thus, when sampling is conducted on the basis of M finite (i.e. complete 
samples are drawn) and M is of moderate size, we are compelled by sheer necessities 
of time and labour to represent the resulting Distribution of o? of Samples by 
a Pearson Curve based on the doubtful assumption that M is infinite, because 
Tchouproff’s formulae, as used in Part III, provide the only reasonable method of 
calculating the values of .M,,,M; and ,.M,. When M is very large, e.g. several 
thousands, this probably matters little as is indicated by the Distribution II (b) of 
this paper, but when M is only of moderate size, the loss of accuracy in represen- 
tation is not clearly known and may be considerable; however the assumption 
appears at present a necessity. 
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When M is quite small, these four formulae will give the moment-coefficients 
of the Distribution of o* of Samples moderately easily, but their practical use 
is liable to cease at this stage. As illustrated here the Distribution of o* of 
Samples, when M is quite small, may be multi-modal, and moreover is almost 
certain to be a distribution with discrete frequency groups at unequal intervals. 


Taking the first of these difficulties, the plurality of modes, this may at first 
sight be overcome by developing a set of frequency curves of the Pearson type 
with more modes than one or by using curves based upon some method such 
as Thiele’s semi-invariants. Now if methods like the latter are to produce multi- 
modal curves, it appears to me, although I have no practical experience in this 
field, that more constants than the apparently usual number of four would be 
required, for these constants are after all equivalent to moments in their essentials. 
The formula for ,M, is severe enough and it is easily seen that formulae for any 
other higher moment-coefficients or combinations of such moment-coefficients 
would be impossibly severe; thus this method would not obviate the difficulty 
of the evaluation of the necessary constants and would fail in practice at any rate 
on this account. It might be possible to develop a quasi-Pearson Curve with two or 
more modes whose constants only involve the first four moment-coefficients of the 
distribution, but, even if developed, it is a very open question as to whether it 
would be efficacious in the representation of actual distributions of this type, and 


until the formulae can be simplified, the use of higher moment-coefticients is not 
practicable. 


The second of the difficulties, that is, the frequency groups becoming discrete 
and at unequal intervals, presents a problem when the representation of the distri- 
bution is required, which is also very difficult and for which, at present, no one 
appears to have obtained any satisfactory solution. 


Enough has been said to emphasise the fact that the problem of the repre- 
sentation of Distributions of o? of Samples from a quite small finite population is 


one which will call for a large amount of research before any practical advance 
towards its solution is likely to be made. 


In conclusion a comparison is given between the Distribution of o? of Samples 
of 10 from Population (C) and the corresponding Pearson Curve obtained from 
the values of ,.M,, .M; and .M, given by the formulae of this part, not because it 
can be expected that a Pearson Curve will represent the distribution successfully, 
for a Pearson Curve is designedly uni-modal, but merely to emphasise the unusual 
character of the distribution when M is very small and also the need for some 
other method of representation. 


For this attempted representation of the Distribution of o* of Samples of 10 
from (C) we have seen that the constants of the curve are to be: 


Mean = 0°257,143, oM, = 0°014,274, 


| 
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The curve is a J-curve of Type I, hence with the usual notation 
r = 2°147,820, e = 1:072,312, 
— 2:147,820 m’ + 1:072,312 =0. 
The last equation has roots 1°358,463 and 0°789,356, and as ,M, is positive 
m= 0°789,356, m, =1°358,463; 
therefore m, = — 0'210,644, Ms = 0°358,463. 
Taking as unit ;4, the squared unit of (C) 
b = 439659, a, = — 62°6514, a, = 106°6173. 
Taking the total frequency to be 1000, y, = 29°6752, and the curve is 
—0°210,644 0°358,463 
y= 206752 1) 
The Origin = Mode = 25°7143 — 78'8095 = — 53-0952. Hence, in units of Popu- 
lation (C), the range is from o?=*096 to o?='535. 


Keeping to the unit of the curve, the ordinates of the curve, which is shown in 
Fig. VIII, are 


Abscissa Ordinate Abscissa | Ordinate 
= 
63 642940 90 | 181488 

66 38°9144 95 15°4086 

70 31°7737 100 | 12°2180 
75 27 °2467 103 9°6807 
80 23°8151 105 7°1806 | 
85 20°8089 106 °62 0:0000 


Now Distribution VIII (a) indicated that, when marbles were used, the Distri- 
bution of Means of 1000 Samples of 10 obtained from (C) by actual sampling 
agreed well with the true Distribution of Means of Samples. Consequently the 
corresponding Distribution of o? of these 1000 Samples was obtained. 


VIII (b). The o of 1000 Samples of 10 from Population (C), the sampling 
being conducted by the use of coloured marbles. 


Value of o? | | :09 | 16 | | -21 | -24 | | -29 | | -41 | 


Qu 


Frequency 15 | 76 | 188] O | 214 | 171 | 56 


The constants are : 
Working origin = ‘24. Working unit, the squared unit of (C). 
= '018,820,0, Mean = 0°258,820,0, = 0°120,080, 
vy = 014,773,4, =0°014,419,2, 2B, = 0°194,083, 
v; =°001,583,6, =0°000,762,8, 2B, = 2°108,627, 
= 000,526,3, 2M, =0°000,438,2. 
Note. In the representation of this distribution in Fig. VIII the unit of o* 
is z}5 the squared unit of (C). 


: 
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FIG: WI. 


HISTOGRAM OF 
DISTRIBUTION MII b. 
FREQUENCY 4—[] 


— — — MEAN. 


| 


fa) 


2021 2425 29 36 41 4445 53-5 

This is a discrete distribution like the true Distribution of o° of Samples of 10 
from (C) given before in this part, of which distribution it is itself a sample of 
course. If we reduce the total frequency of this true distribution of o? from 


254,186,856 to 1000, we can compare this sample with it. Carrying out this 
reduction, we have 


True Frequency 
> | Frequency in reduced by 
Value of o? Vill (0) | 1000 

254,186,856 
“00 15 | 12°86 
16 188 191°53 
*20 0 | 0-00 
"21 | 214 228°02 
"24 171 156°76 
*25 56 52°67 
*29 0 0:14 
| 10 9°94 
47 55°09 
130 | 128-99 
45 93 83°61 


| 

| 
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Applying the Goodness of Fit test, we have 
x? = 5°41, P =:9075 with 12 Groups. 


This shows that VIII (b) is an excellent sample of the true Distribution of o* 


of Samples of 10 from (C), and consequently an attempt has been made to compare 
VIII (b) with the J-curve. 


Now VIII (5) is a discrete distribution with unequal intervals, so the frequency 
of any value of o? was represented by a rectangle, whose base was the distance 
between the two points of bisection of the intervals on either side of this value 
of o*% The height of the rectangle being properly chosen the frequency of this 
value of o? is thus represented by an area, which is the first step in the comparison 
because it is areas under the J-curve that represent frequencies. However, as the 
intervals between successive values of o® in VIII(b) are unequal, this method 
causes the value of oc? considered not to be on the mid-ordinate of the rectangle 
whose area represents the frequency of the value. Thus while the representation 
is purely approximate and pictorial, it suffices to show that the J-curve in no way 
represents the Distribution of o? of Samples of 10 from (C). 


Summary of Conclusions reached. 


(a) The Distribution of Means of Small Samples from an infinite Population. 


(1) Even when the sampled population is quite skew, this distribution exhibits 
a surprising tendency to normality. Hence its approximate representation by a 
normal curve will be quite good in general and the assumption that, for small 
samples as well as large, the distribution of means from a quite skew infinite 
population is normal will be quite useful in practical statistics. 


(2) By the aid of moment-coefficients a good representation of this distribution 
by a Pearson Curve can be obtained showing quite well the skewness, small as this 
is in general. 


(3) The true curve of means of samples is known when the sampling is from a 
population represented by a Pearson Type HI Curve. 


(b) The Distribution of o? of Small Samples from an infinite Population. 


(1) Tchonproff’s corrected formulae for the moment-coefficients of this distri- 
bution will lead, with the aid of a Pearson Curve, to a good representation of the 
distribution in practical cases. 


(2) The labour involved in the calculation of the higher moment-coefticients 
of the sampled population, necessitated by the exact use of Tchouproff’s formulae, 
can be obviated by the assumption that a Pearson Curve will represent the popu- 
lation, which enables approximate values for these higher moment-coefficients to 
be calculated rapidly by the well-known difference formulae. This approximation 
does not appear to affect the accuracy of the representation of this distribution of 
o” of samples to any appreciable extent. 
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(c) The Distribution of Means of Small Samples from a finite Population. 


(1) Here again, even when the sampled population is very skew and as small 
as 36, the Distribution of Means of Samples still shows a very strong tendency to 
normality. In fact, it appears that the assumption of the normality of this distri- 
bution is legitimate for practical purposes, no matter what the size of the sample 
or the sampled population, and no matter upon which basis the sampling is con- 


ducted. It is only likely to fail as a practical hypothesis if the sampled population 
follows a really extreme form of skewness. 


(2) This distribution can be represented satisfactorily by a Pearson Curve 
based on the formulae for its moment-coefficients given in Part IV. 


(3) Subject to conditions that have been stated broadly, it has been possible to 
show that this distribution can be represented approximately by a Pearson Curve 
based upon the values of its moment-coefficients which result from the assumption 
that the sampled population is infinite in all its categories. That is to say, it 
has been possible to indicate the conditions under which, in dealing with the 
Distribution of Means of Samples, it is immaterial whether the sampling be con- 


ducted by drawing each time complete samples or individuals only before replace- 
ments. 


' (d) The Distribution of o? of Small Samples from a finite Population. 


(1) The two formulae for the 3rd and 4th moment-coefficients of this distri- 
bution have been obtained, and thus with Dr Neyman’s results the first four 
moments of the distribution are known. The great length of these two formulae 


however contributes materially to the difficulty of accurate representation of this 
distribution. 


(2) When the sampled population is large or of moderate size, the length of 
the formulae for .M; and .M,, together with their general unwieldiness, makes the 
evaluation of these constants by their aid almost an arithmetical impossibility at 
any rate in practical statistics. We are thus compelled to resort to an approxi- 
mate representation of the distribution by assuming the sampled population infinite 
and calculating approximate values for the moment-coefticients of the distribution 
by means of Tchouproff’s formulae as in Part III. This approximation is probably 
quite good when the total frequency of the sampled population is of the order of 
several thousands, but for moderate-sized sampled populations with total frequency 
several hundreds its usefulness and accuracy are certainly questionable. 


(3) When the sampled population is very small in its total frequency, the 
formulae for the moment-coefticients of the Distribution of o* of Samples give 
the values of these constants quite accurately and with moderate ease, but the 
representation by a Pearson Curve is liable to fail. Since the sampled population 
when it is small, has only a few categories, the distribution of o? may become 
multi-modal and hence, as the Pearson Curves involve the assumption of a uni-modal 
frequency, they are naturally incapable of representing the distribution; whilst 
further, not only does it tend to become one with discrete frequencies, but these 
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frequencies occur at unequal intervals, rendering its representation by any curve 
a matter of doubtful validity and accuracy. 


(e) Considerations upon the actual practice of sampling. 


When the actual sampling is conducted on the basis of an infinite sampled 
population, that is, individuals only are drawn before replacement, the use of one 
sample whose randomness is established, such as Mr L. C. Tippett’s Random 
Numbers, provides a really reliable method. Further, it is one which takes pre- 
cedence over any system of drawing objects from a bag, bow] or urn because of its 
much greater speed and ease in operation. 


When the sampling is conducted on the basis of a finite sampled population, 
that is, complete samples are drawn each time before replacement, or when a 
typical random sample is not available, the use of marked tickets appears to be 
of little or no value because of the almost insuperable difficulty of obtaining 
reaily effective shuffling between consecutive draws. If the sampled population 
be very large, ronghly homogeneous and symmetrical material such as commercial 
coloured glass beads, which provide easy means of distinguishing between the 
various categories of the sampled population, appears to be quite satisfactory in 
practice. If however the sampled population be quite small, the lack of true 
homogeneity even in such material as this will cause bias in actual sampling and, 
if good results are to be obtained, some material the individuals of which really 
approximate quite closely to the ideal of complete homogeneity must be employed. 
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TABLE OF THE RATIO: AREA TO BOUNDING ORDINATE, 
FOR ANY PORTION OF NORMAL CURVE. 


By JOHN P. MILLS. 


Introduction*. It has been recognized for some time that a table of the ratio, 
area to ordinate, for the normal curve would be useful. It was pointed out, for 
example, in Biometrika, Vol. xvi. p. 164, that the area A of the tail of a frequency 
curve could be found approximately by the formula A = ¢ (0) R., where ¢ (0) is 
the ordinate of the curve at the stump and x and R, depend on the first and 
second derivatives at that point. Particular examples of application to the point 
binomial were given in succeeding pages of that volume, and to the hypergeometric 
series in Vol. xvit. pp. 61 et seg. In all these cases the need for a tabulation of 
RK. was apparent because for certain values of # its value could not be obtained 
easily and with sufficient accuracy from existing tables of the area and the ordinate. 
The ratio R_, is also important because its reciprocal equals the mean value of the 


area of the tail, from # onwards, of the normal curve y = ge e-**. The methods by 


which this table were computed were different in different parts. For the interval, 
0 = «= 1°84, Sheppard’s well-known tables gave correct results, save in six cases, 
which were examined separately by the methods used later. For 1:84 < # S 4:20 


it was necessary to use the tables of Dr Jas. Burgess, Trans. R. S. Edinburgh, 
Vol. xxx1x. He tabulated 


If then Re=p,V2 at t=a/Vv2. 


Accordingly p was first computed, the values of € being taken from an eight place 
logarithm table, and the divisions made on a machine. For 1°300 < t = 1-500, H and ¢ 
were extracted at intervals of At =0°005, and p, and KR, computed. This gave a 
provisional table of ®., at the inconvenient points, 1°300 V2, 1:305¥V2, etc. What 
was desired was a set of values at «= 1°80, 1°81, ete. These could be found by 
interpolation from the provisional table already obtained, but to produce the 
desired accuracy easily, it was desirable first to obtain twice as many values of R., 
at intervals half as great. These were found quickly by Pearson’s 4-point mid- 
panel formula. For the interval, 1°50 < t = 2°15, sufficient accuracy could be obtained 
by first extracting ¢ at intervals of 0°01, and then halving them as before. For 
2°15 = t = 3:00, a somewhat greater simplification was possible. By computing 
several values of p; near the point ¢ = 2°20 V2 the value of R., was found by inter- 
polation at 2°20; and then, in similar manner, at 2°30, 2°40, ete. Then the 
intervening tenths of the # interval were filled in by using Pearson’s 5-point 
formula. From «= 4°25 (t=3) to the end, the values were supplied by the Editor 


* By B. H. Camp, under whose direction this table was prepared. 
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from material already available at the Galton Laboratory. Some of this part was 
also computed by the author from the formula 
1.3.5 

The entire computation was checked by differencing, and by overlapping the 
several methods. The errors introduced were investigated. Eight places of 
decimals were retained throughout, and the original computation therefore gave a 
table of R., to eight places. In no case did the possible error in the eighth place 
jeopardize the accuracy of the fifth place retained in the final table; save only in 
the six instances already mentioned near the beginning of the table, and these 
were eventually calculated by the later methods to nine places. 


(the RatioR, =e" | ede.) 


x 

Rz -& Rz -A 
“00 1°25331 993 *40 ‘93567 623 
“01 1°24338 982 “41 “92944 615 
“02 1°23356 969 *42 *92329 609 
‘03 1°22387 957 43 “91720 602 
1°21430 946 ‘91118 596 
O05 120484 934 ‘90522 590 
06 1°19550 923 46 “89932 583 
Oy 1°18627 911 “89349 577 
08 1°17716 900 48 "88772 571 
“09 1'16816 890 49 *88201 565 
‘10 1°15926 78 “87636 558 
‘11 1°15048 869 51 *87078 553 
12 1°14179 858 "86525 548 
13 1°13321 847 "85977 541 
1°12474 838 "85436 536 
°15 1°11636 827 *84900 5380 
16 1°10809 818 56 *84370 525 
1°09991 808 57 "83845 519 
18 1°09183 799 ‘58 *83326 514 
19 1°08384 790 59 *82812 509 
10754 780 *82303 504 
“21 1°06814 771 ‘61 “81799 498 
"22 1°06043 762 *62 *81301 494 
2, 105281 754 "63 “80807 488 
1°04527 745 64 “80319 484 
"26 1°03782 736 65 *79835 478 
*26 1°03046 728 66 *79357 474 
1°02318 719 “78883 469 
1°01599 712 68 “78414 465 
*29 1°00887 703 69 "77949 460 
*30 100184 696 “70 ‘77489 455 
*31 0°99488 687 *77034 451 
“82 “98801 680 “76583 446 
“98121 673 73 ‘76137 442 
‘97448 665 Th *75695 438 
35 ‘95783 657 *75257 433 
36 “96126 651 ‘76 *74824 430 
‘95475 643 “74394 425 
*38 "94832 636 *73969 421 
‘94196 629 *73548 417 
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TABLE (continued). 


“80 73131 413 1°30 56487 265 
“81 “72718 409 1°3. “56222 262 
“82 *72309 405 1°32 55960 261 
"71904 401 1°33 *D5699 258 
84 *71503 397 1°34 55441 256 
85 71106 394 1°35 55185 254 
86 70712 390 1°36 54931 252 
*70322 387 1°87 54679 249 
*88 69935 382 1°38 54430 248 
*89 69553 380 39 54182 246 
90 69173 375 1°40 53936 244 
“91 68798 373 1°41 “53692 242 
“92 68425 368 1°42 53450 240 
93 “68057 366 1°43 53210 238 
“94 67691 362 1-44 52972 237 
95 *67329 358 1°45 52735 234 
96 -66971 356 1°46 52501 233 
‘97 66615 352 147 52268 230 
‘98 66263 349 1°48 52088 229 
“99 “65914 346 1°49 51809 227 
1°00 *65568 343 1°50 *51582 226 
1°01 *65225 340 1°51 51356 223 
1:02 64885 336 1°52 51133 222 
1°03 “64549 334 1°53 50911 221 
1:04 64215 330 1°54 50690 218 
1°05 63885 328 1°55 “50472 217 
1°06 63557 325 1°56 50255 215 
1°07 63232 322 1°57 50040 214 
1:08 62910 319 1°58 49826 212 
1:09 62591 317 1°59 *49614 210 
1°10 62274 313 1°60 49404 209 
1°11 61961 311 1°61 *49195 207 
1:12 “61650 308 1°62 48988 206 
1°13 “61342 306 1°63 “48782 204 
114 “61036 303 1°64 48578 202 
1°15 60733 300 1°65 *48376 201 
1:16 60433 298 1-66 *48175 200 
1°17 “60135 295 1°67 "47975 198 
1:18 59840 292 1°68 AT777 197 
119 59548 291 1°69 47580 195 
1°20 59257 287 1°70 47385 193 
1°21 “58970 286 “47192 193 
1°22 58684 282 1°72 *46999 191 
1°23 *58402 281 1°73 46808 189 
1:2 58121 278 1°74 -46619 188 
1°25 57843 276 1°75 46431 187 
1:26 *D7567 273 1°76 46244 186 
1°27 *b7294 272 1°77 *46058 184 
1°28 “57022 268 1°78 45874 182 
1°29 56754 267 1°79 "45692 182 


¢ 
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TABLE (continued). 


Ry 


Rz 


1°80 
1°81 
1°82 
1°83 
1°84 


1°85 
1°86 
1°87 
1°88 
1°89 


1°90 
1°91 


*45510 
*45330 
“45151 
°44973 
“44797 


"44622 
“44448 
"44275 
“44104 
43934 


*43765 
43597 
“43430 
“43265 
*43100 


42937 
*42775 
“42614 
°42454 
*42295 


*42137 
“41980 
*41825 
“41670 
“41516 


°41364 
“41212 
“41062 
“40912 
“40764 


“40616 
“40470 
“40324 
“40179 
*40036 


*39893, 
*39751 
*39610 
*39470 
39331 


*39193 
*39055 
*B8919 
38783 
“38649 


*B8515 
*38382 
*38250 
38118 
37988 


*37858 


*37729 


37601 
‘37474 
*37348 


*37222 
*37097 
*36973 
*36850 
36727 


*36605 
“36484 
“36364 
36244 
*36125 


*36007 
“35889 
*B5773 
35657 
35541 


“35427 
*35313 
“35199 
°35087 
34975 


*34863 
*34753 
34643 
*34533 
"34425 


34316 
*34209 
“34102 
“33996 
*33890 


*B3785 
*33681 
*33577 
*33474 
*33371 


*33269 
33168 
*33067 
*32967 
32867 


32768 
32669 
*32571 
“32474 
*32377 


; 
398 
180 2°30 129 
179 2°31 128 
4 178 2-32 127 
176 2°33 126 
175 2°34 126 
174 2°35 125 
173 2°36 124 
171 2:37 123 
170 2°38 123 
a 169 2°39 122 
168 121 
167 2-41 120 
1°92 165 2-42 120 
1:98 165 2-43 119 
1:94 163 118 
1°95 162 O45 118 
1°96 161 246 116 
1:97 160 116 
1°98 159 2-48 116 | 
1:99 158 2-49 114 
2:00 157 2°50 114 
2-01 155 251 114 
2-02 155 2°52 112 
2-03 154 2°53 112 
2-04 152 254 112 
2°05 152 2°55 110 
2-06 150 2°56 110 
2-07 150 257 110 
2°08 148 2°58 108 
2-09 148 2-59 109 | 
146 2-60 107 
ee: 211 146 261 107 
2:12 145 262 106 
aS 2°18 143 2°63 106 
214 143 264 105 
215 142 2°65 104 | 
2°16 lal 266 104 
217 140 2-67 103 
2°18 139 268 103 
2:19 138 269 102 
2-20 138 270 101 
221 136 271 101 
2-22 136 272 100 
2°23 134 2-73 100 
2-24 134 27h 98 
2°25 133 2-75 98 
2°26 132 2°76 98 
2-27 132 277 97 
2-28 130 2-78 97 
2:29 130 2°79 97 
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TABLE (continued). 
Rez -A x 


2°80 “32280 96 3°30 -28064 
2°81 “32184 95 3°31 -27990 
2 82 “32089 95 3°32 ‘27917 
2-83 “31994 94 3°36 “27844 
284 “31900 94 3°34 27772 


2°85 31806 93 3°85 27699 
2°86 “31713 93 3°36 27627 
2°87 31620 92 3°37 27556 
2°88 *31528 92 3°38 27485 
2°89 31436 91 3°39 27414 


2°90 31345 91 3°40 27343 
2°91 *31254 90 3°41 °27273 


2°92 31164 90 3°42 *27203 
2:93 “31074 89 B43 *27134 
2-94 “30985 89 BA -27065 
2°95 *30896 88 3°45 *26996 
2°96 *30808 88 3°46 *26927 
2°97 *30720 88 *26859 
2°98 *30632 87 3-48 -26791 
2°99 *30545 86 B49 *26724 
3°00 “30459 86 3°50 *26657 
3°01 *30373 86 3°51 +26590 
3 02 *30287 85 3°52 *26523 
3°03 *30202 84 3°53 *26457 


3°04 30118 84 B54 26391 


*30034 84 3°55 26326 
29950 84 3°56 *26260 
+29866 82 8°57 *26195 
29784 83 3°58 “26131 
29701 82 3°59 *26066 


*29619 81 3°60 *26002 63 
*29538 82 3°61 *25939 64 
29456 80 362 *25875 63 
*29376 81 3°63 25812 63 
*29295 80 3°64 *25749 63 


3115 "29215 79 3°65 25686 62 
3:16 29136 79 3°66 *25624 62 
3°17 *29057 79 3°67 *25562 62 
318 ‘28978 7 3°68 *25500 61 

If *28900 78 3:69 *25439 61 


"28822 | 78 370 *25378 61 
28744 77 3°71 *25317 61 
*28667 77 B72 *25256 60 


28590 76 3°73 25196 60 
28514 76 25136 60 


*28438 
28363 
28287 
28213 
28138 


375 “25076 59 
3°76 25017 60 
3°77 *24957 59 
3:78 *24898 58 

“24840 59 


Oo 


ce 

| 

| 


74 
73 
73 
72 
73 
72 
71 
71 
71 
70 
70 
69 
69 
69 
69 . 
68 
68 
67 
67 
67 
67 
66 
66 
65 
| 66 2 
3°05 
3-06 
3°07 65 
| 
3:10 ay 
3°11 
3:14 
3°20 
3°21 
3°22 
3:23 
3°24 
8:25 
8°26 
3°27 
3°28 
3°29 
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3°80 *24781 58 6°00 "16238 253 
3°81 *24723 58 6°10 “15984 246 
3°82 "24665 58 6°20 *15739 239 
3°83 “24607 57 6°30 “15500 232 
3°84 "24550 57 6°40 "15269 225 

3°85 "24493 57 6°50 "15044 218 
3°86 "24436 57 6°60 "14825 212 
3°87 *24379 56 6°70 *14613 206 
3°88 "24323 56 6°80 *14407 201 
3°89 "24267 56 6°90 *14206 195 
3°90 24211 56 7-00 *14010 190 
3°91 "24155 55 7-10 *13820 185 
3:92 *24100 55 7°20 13635 180 
3°93 "24045 55 7°30 °13455 176 
394 *23990 55 7:40 13279 171 
3°95 "23935 54 7:50 "13108 167 
3°96 *23881 55 7°60 *12941 163 
3°97 "23826 54 *12778 159 
3°98 23772 53 7°80 12619 155 
3°99 *23719 54 7°90 *12464 151 
4°00* 23665 264 8°00 *12313 148 
405 *23401 258 8:10 *12166 144 
4°10 *23143 253 8°20 *12021 141 
4185 *22890 248 8°30 “11880 138 
4°20 "22642 243 8°40 11743 135 
4°25 *22399 238 8°50 “11608 132 
4°30 22161 233 8°60 11477 129 
*21928 228 8°70 *11348 126 
4°40 *21700 224 8°80 *11222 123 
*°21476 219 8:90 ‘11099 120 
4:50 21257 215 9°00 10979 118 
21042 211 9°10 10861 115 
4°60 ‘20831 207 9°20 "10745 113 
465 *20624 203 9°30 10632 111 
470 20421 199 9°40 “10522 108 
ATS *20222 195 9°50 *10413 106 
4°80 *20027 192 9°60 *10307 104 
485 “19835 188 9°70 *10203 102 
4°90 *19647 185 9°80 ‘10101 100 
495 "19462 181 9°90 *10001 98 
5:00* *19281 353 10°00 09903 

5:10 "18928 341 

5:20 *18587 329 

5°30 *18258 318 

5°40 -17940 307 

5°50 17632 297 

5°60 *17335 288 

570 *17047 278 

5°80 “16769 270 

5:90 “16499 261 

* Note the change in the x interval. 
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ON THE CORRELATION OF THE MEAN AND THE 
VARIANCE IN SAMPLES DRAWN FROM AN 
“INFINITE” POPULATION 


By J. NEYMAN, Px.D., Warsaw. 


1. THE relation between the two characters in a sample, mean % and standard 
deviation o (or variance o*), is a very interesting one as several methods of judging 
whether a certain sample has been taken from a given population, or if two 
different samples have been taken from the same population, are based on the 
comparison of certain functions of % with certain functions of o. 


For the case where the original frequency distribution is normal, the question 
has been largely discussed, in particular in the paper of “Student” * in which the 


probability is found of having different values of -, where % is measured from the 


mean of the sampled population. We know that in the case of a normal popula- 
tion @ and o are independent, but we know further that the populations having a 
limited range which can be met with in practice are, speaking rigorously, never 
normal. In many cases, also, we are dealing with populations which differ from 
the normal ones in other characters besides the range. It is interesting therefore 
to consider the relation of % and o for such populations in order to get some idea 
about the possible differences in these properties. 


In dealing with this matter we may use the results of an earlier paper by the 
present author about skew regression published in this volume of Biometrika, and 
also the formula of K. Pearson giving the second order parabola of regression ft. 
After deducing a second order parabola of regression of c* on @ in a sample taken 
from an indefinitely large population with any given distribution, we shall show 
that the calculation of higher parabolae is of little value, as their expressions for 
small values of n—the number of individuals in a sample—are too complicated 
for us to draw practical conclusions from. Further the difference between them 
and the expression of the second order parabola vanishes as n -> 00. 


Afterwards we shall draw attention to a method of judging whether a sample 
is likely to have been taken from a population whose distribution is supposed to 
be knowa. 

2. We shall use a notation like that in the previous paper, namely: p, will 
denote the kth moment of the sampled population about its mean; 4,—the mean 
value of (a? — a) Z* and n—the number of individuals in the sample. 


* Biometrika, Vol. v1. pp. 1 et seq. 
+ Biometrika, Vol. x11. pp. 296—300. 
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It is convenient to measure the deviations of the two variates o? and % from 


4 their respective means in terms of their standard deviations, namely 

n n 

fi Adopting these units we shall put in our yt for the higher parabolae of 

regression, instead of h;, the expression 

o* ( ) 


The deviation of the mean in any sample from its mean value in samples we shall 
: denote by # and the ordinates of the parabola of regression measured from the 
mean of o? by u’; when #’ and w’ are measured in terms of their standard devia- 
tions as given in (1) we shall term them @ and wu. We shall also need the usual 
frequency constants 8 for the sampled population and for the population of means 
Be of samples taken from the former. The first we shall denote by 8,, 8, ... and the 
second by B,, By, .... 


The formula for the second order parabola of regression can be taken directly 


from the paper of K. Pearson (Joc. cit.), and applying it to the variates under con- 
sideration we have 


It remains to express all the constants q, and the B’s in terms of the f’s. All 


necessary formulae are contained in my paper in Biometrika, Vol. xvi. pp. 472— 
479. We have 


Vvn-1 VB, B 

B,-—3 (5) 

where p’, R’ and o are symbols previously used. 
Further B,= Bs , B= (6). 

n n 


Using these formulae, we have 


(7), 
and the square of the corresponding second order correlation ratio is 
H? n—1 (B, — — 3)? 

(8, — 

2n* (Bz — — 1) 8b 
{(n-1) +3} {2n + By— B, —3} ( 


4 

j 

| 

| 

| 
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From (8a) we see that for large values of n the second order term has no 
importance provided A, is not zero. If 8,=0 the first order term disappears and 
the correlation depends only on the remaining term, unless 8, = 8, at the Normal 


Point, when this term also vanishes. When n—> » , then H.2—> a4 , that is to say 

the contours in the A, B field of equal correlation between % and o? are approxi- 

mately straight lines radiating from the point 8,=1, 8,=0. Along the boundary 

line 8,=0 we have seen that H,?=0, while the form (8b) shows that along the 

other boundary of the field, namely, 8, — 8,—1=0, there is perfect relationship, or 

HA} =1. 

This latter equality arises from the well-known fact that the populations 
corresponding to §,, 8, points along this line contain only two classes of 
individuals; the same condition must hold in the sample, where let us say there 
are p individuals with character value a and n— p with character value b. Then 
the mean and the variance of the sample will be definite functions of p, namely 


(9), 
n n 


If now we express a and b in terms of 8, and 8,=£,+ 1, we shall be able to 
transform (10) into the regression parabola (7) corresponding to these values of 
the f’s. 

Turning to (7), it will be noticed that the sign of the coefficient of % depends 
upon the sign of «= 8,—8,—3. That is to say for samples from populations whose 
distributions correspond to §’s lying above* the line B,—8,-3=0, the two 
infinite branches of the regression parabola are directed in the negative sense of 
the u-axis, while for samples from populations with #’s lying on this line the 
second order term vanishes and (7) becomes linear; and for samples from popula- 
tions with §’s below this line, the branches of the parabola are directed in the 
positive sense. &,—8,—3=0 is the line along which lie the §’s of distributions 
representing Poisson’s limit to the binomial, and it also divides the areas of positive 
and negative binomials. 

Let us now consider the form of (7) a little more in detail. In general (that is 
to say, when the original frequency distribution is not a U- or J-shaped one) the 
most interesting points of the #-axis are those near the mean and the mode, and 
we shall consider the behaviour of the second order parabola of regression near 
these points. The abscissa of the mode of means in samples is not known, but we 
can take, as an approximation to it, the abscissa M(%) of the mode of a Pearson 
curve fitted to the distribution of means. It is given by 

M (2) = 8) (11). 
2 (5B, —6B,—9) n 2[6(n—1)+5B8,—6f, —9] 


* Above, in the sense of Fig. 2. 


| 
et: 
| 
| 
| 
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The sign of M(Z) depends upon the sign of VB, [6 (n—1)+ 58.—68,-9] The 
expression in brackets is generally positive. In fact, if it be negative, then 
58. —68,—9 must be negative, which happens when the original frequency 
distribution is a J-shaped or a U-shaped one and is unusual. Further, as 8, >, + 1, 
6(n—1)+ 5B, —€2,-92>6n —B,—10 
and if the left-hand side be negative then the right-hand one must be so and there- 
fore B,>6n—10, which for any value of n > 2 gives values of 8, very unlikely to be 
met with in practice. Therefore we shall consider only the case, when the denominator 
in (11) is positive and so that the sign of M (2) is that of —V8,. 
Let & be the abscissa of the vertex of the parabola (7). We find easily: 
By 3° 

If 8, — 8, —3< 0 then the signs of M (%) and & are different and thus the mean of 
the population is between the mode M(%) and the vertex of the parabola. We 
remember that in this case the infinite branches of the parabola are directed in the 
negative sense of the u-axis. 


Suppose now that 8,—8,-—3>0. Then the signs of & and M(#) are the same. 


Let us consider 
E An[6 (n—1) + 58, — 68, — 9] 


and find the condition that it be less than unity. 


This condition can be written 
12n? + 2n (5 — 68, 15) < 6n (By — 3) + 
and we see that it is satisfied if n lies between the two roots of the equation 
12n? + 2n (28, — 38, — 6) — (B,—8) (Bz — — 3) = 0, 
that is, (as n is positive), when 


(282 — — + 12 (82 3) (Be — 3) (282 38 6) 
12 


As 0 <8, < Bz —3, it follows that for &< M(%) it is necessary that 
5 (B,-3 12n +15 
In practice we meet very rarely a ®, with so high a value as 7°8 (which corre- 
sponds to the case n=2!) and therefore see that with rare exceptions the mode 
M (2) lies between the vertex and the mean of the population. Further, in the 


case we are considering (8, — 8, —3 >0) the infinite branches of the parabola are 
directed in the positive sense of the u-axis. 


We conclude therefore that for samples from populations likely to be met with 
in practice, that is to say when 


A, +10 and n> 


n> 


| 

| 
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the ordinate, u, of the regression parabola of the variance decreases as we pass 
from the mean value of % to its modal value in samples. Further if the size of the 
sample n is considerable, since the ratio £/M (#) is of order n, the variation of u 
in the neighbourhood of the modal and mean % will be approximately linear. 
The position can be illustrated diagrammatically as in Fig. 1, which corresponds to 
the case of a sample drawn from a population with positive skewness (yu, or VB, 
positive) and having 8, — 8, —3 < 0. 


O;) signifies 
a length of order Ve OF 


MODAL X 
MEAN X 


Fig. 1. Regression of Variance on Mean in Sample of n. 


It is in fact only in small samples that the regression curve diverges far from 
a straight line within the range of significant frequency, ie. within % = + 30;, 
and w= + 30,:, say. This is brought out clearly in the Tables on pp. 411—413, 
which give the values of H,=|r|, H,, 100 x ee for samples of 10, 50, 100 

2 

and 1000, and for population values of 8, and ®, covering the range commonly 
met with. Table IIT shows how the second order term becomes of slight importance 
for samples over 100, except for very small values of 8, and for extreme values of By. 
Tables I and II make it clear that we may be very far wrong when considering a 
sample from a population of unknown form, if we assume that the normal rule of 
no correlation between the mean and the standard deviation in the sample is 
applicable. 


Let us consider the locus of points on the 8,8, field corresponding to a given 
value of H,. We have from (86) __ 
By — 1)=(1 — — 1) n+ 3] [2n + B, — By —3]...(13), 
which is the equation of a hyperbola passing through the point 


2 n-3 


This hyperbola is independent of H,, and has the asymptotes 
(1 — H,2) [(n— 1) 2 +3] (14), 
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0 2 “4 6 8 1-0 12 1-4 
Straight Line — — —correspords to 2—- OO 
Hyperbola ” 2 =100 
” » 2 = 10 


40 


: 
5-0H 5-0 
: 
6-0 T +— I T T I I I I 6-0 
0 2 “6 8 1-0 1-2 1-4 
By 
Fig. 2. Diagram of 8;, 82 Field showing Contours of Constant Hy for Different Sizes of Samples. 
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If n> © then this hyperbola degenerates into a straight line 


(16), 
passing through the point B,=0, =1. 
Fig. 2 shows the 8,8, field with two sets of hyperbolae (13) corresponding 
to n=10 and n= 100 and to H, °4, ‘8 and 1:0, and with the set of straight 
lines (16) corresponding to the same values of H,. We see that the hyperbolae 


for n= 100 are close to the corresponding straight lines except for very small 
values of Hy. 


Considering this figure we see that although the very high values of H, fall 
on the whole in the area of J- and U-curves, this constant may yet have quite 
considerable values for populations of types commonly met with, and it is clear that 
one must be very cautious in drawing conclusions from Student’s test, if there be 
no good reason for supposing that the original frequency distribution is normal. 
We see further that this is not a case where the position is improved in large 
samples. 

The difficulties involved in the application of Student’s test may be illustrated 
also in the following way. Assume that we are dealing with a sample from a 
population with 8, ='27 and 8,=3:5*. Suppose further that the variance of the 
sample shows a deviation from its mean equal to 20,:; for the deviation of 
the mean in the sample, we can take two cases, first—that it is equal to + 20; 
and second—that it is equal to —20z. If we express the deviations of both the 


variates in common units, for instance in terms of o,= , Me 2 we shall have 
n 


1424/2 0438), B= 42, 


where o is the standard deviation of the sample, and % the deviation of the mean 
of the sample from the mean of the population. If now we do not know the 
moments of the sampled population but only o and the position of the sample 
mean, we might use Student’s method to test the probability of drawing such a 
sample from a population with mean at distance + 2 (or — 2) from the sample mean. 
This we should do by calculating in both cases the same ratio, viz. 

| 


z= in Student’s notation, where o = s, or in that of Fisher, 


If for instance n = 11, we find that z=:45 and ¢= 1°42. 


* These figures correspond to a set of 200 yields of a serial in a field experiment described by Gorski 
and Stefaniow in Rocznik Nauk Rolniczych, 1917. 

+ For tables of z, see Biometrika, Vol. v1. p. 19 (revised and corrected as Table XXV in Pearson’s 
Tables for Statisticians and Biometricians, 2nd Edition), and for t, Fisher’s Statistical Methods for 
Research Workers, 1925. 


| 
| 
| 
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Now let us consider the two cases from the point of view of the correlation 
between the variance and the mean. We shall see below that the difference 
= therefore we can assume that the 
actual regression line is well fitted by the second order parabola. If we calculate 
its ordinates corresponding to the two points %= + 2, we shall have 


u(—2)=— "564; (+2)= "688. 
Taking the ratio of the deviations of the actual value of o* in the sample, from 


these two means of different arrays, to the corresponding mean array-standard 
deviation, we have 


between H,?— H,’, where k > 2, is of the order 


2 
and we see at once that the probability of obtaining the two samples (% = — 2, 


+ 2) and (®=+2, from the same population is very different, 
although Student’s test gave us the same figure, t = 1-42, for both. 

The method described can be considered as a general method of testing 
whether a sample is likely to have been taken from a population with a given 
distribution, at least, with given and py. 

As we have seen, it consists in comparing the deviation —-—-————— with its 


standard deviation V1 — H,2, that is to say, in calculating the ratio 


Ao? — u(%) 


n V2 (2n + By — By —3) By— 1) 


where % is the deviation of the mean in the sample from the mean of the popula- 
tion measured in terms of af &, #2 being the variance of the population; Ago? is 


the deviation of the variance in the sample from its mean value 


n—1 
measured 


in terms of 


n is the size of the sample; 8, and f, refer to the sampled population. If L is 
nearly 3, we may consider it as improbable that the sample has been taken from 
the population considered; if however Z < 2 such an assumption will not be in 
disagreement with the observed mean and standard deviation of the sample. 
Unfortunately such cases in which we know the constants w., 8, and P, of the 
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sampled population are very seldom met with. When as usual they are un- 
known the results of this paper serve to emphasize the caution which it is 
necessary to exercise in drawing conclusions from the contents of small samples*. 


3. We shall now prove thatthe difference H,2 — H,?, where k > 2, is at least of 


the order *.. We shall do it by showing that any term V;,2/A,_,A; of the 


series (35)+ is of the order = As the calculations necessary for this purpose are 


easier to do than to describe, we shall avoid the details and simply sketch the 
general method of procedure. 


We have 
qo n> Uk 
| Mea, Mp, | 
| Me, M, 
M,, M, Miss 


M;, eee My 
where the M’s are the moments of the mean of the sample about its mean, measured 
in terms of o3. 


Let us calculate some first terms of M’s and q’s which will be clearly 
polynomials in =. For this purpose let X be the deviation of the mean in the 
sample from its mean measured in ordinary units and [A] be the mean value of any 


* To avoid possible misunderstanding we may here add the following remarks: 

Let us consider the variate v,=07 u, (Z), 
where w, (%) is the ordinate of the kth order parabola of regression. The variate v, is a character of 
the sample in the sense that to every sample there corresponds a perfectly definite value of v;,. There- 
fore we may calculate the mean 2, of v, and its standard deviation ¢, . We have, rigorously, 

remembering that o? is measured in ‘terms of its own standard deviation. 

The proposed method of testing is therefore constructed on the usual principle of comparing the 
deviation of a variate with the corresponding standard deviation. We notice that if k increases, 
oy, decreases and thus our test becomes more and more accurate. 

Of course it would be useful if we knew the value of the standard deviation of v; for each array 
corresponding to the different values of Z, because very probably they are not constant. Then it would 
be possible to consider the variates v;, (Z) corresponding to different values of %. In any case the pro- 
posed L-test seems to be as justified as any other test built on the same principle. 

+ In my previous paper. 
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variate A. Then, if a, a, ... #, are the deviations of individuals in the sample from 
the mean of the population, 


n k 
> 
k! 


where the sum is extended over all combinations of non-negative integral values 
of r’s satisfying the condition 
Tet. t+ 
Accordingly 
1 


This sum contains several classes of equal terms corresponding each to a single 
system of values of the r’s written in different orders, the number of terms being 
dependent upon n. We notice that ~,=0 and we assume yy»=1. The most 
numerous class of non-vanishing terms of (20) corresponds to the following 


systems of r’s: 
k 
m=2 (i=1,2...5) 


2 


1,...») 


if k be even, 


and 


if k be odd. 


Tn the first case let k = 2s and in the second k= 2s+1. We have 


(2s)! mn! pos? 
si (n—s)! (higher terms in (21), 


= 


(2s + 1)! nm! ( 
[x2] = +... higher terms in 


Proceeding in this way and dividing finally the terms of [X*] by (BY and the 
2s+1 


terms of [X**1] by (“) we find 


n 


(2s)! s(s—1)/B, 1 1 


3!(s—1)!2Vn 1028; 3! (31° 


4 

| 

a 2 

r;=0 (i +1 n 

J 

| 
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In the same way, 


Vn (2s)! (2s)! 1 


= 


nv By (2s +1)! 8 [Bs 3(s—1) 
—1)(s—2)R, 9s°+11s+6] 1 


Putting these values into A;, we find that in its expansion the terms independent 
of n do not vanish and so that if n > 2 we have 


As to the expansion of V;, (where i: >3), not only do the terms independent of n 


: 

vanish, but the terms with the lowest power of = , that is with —, vanish also. 

The next power, or 2 in this expansion is the first one. Thus V; has a common 


1 
factor = and so the kth term of the expansion (35) of my previous paper, for k > 3, 


1 
has the common factor —. If n be large this can be considered as showing that 


the second order parabola gives a good fit to the regression line of the variance on 
the mean of the sample. 


In conclusion I wish to express my warm thanks to Dr E. S. Pearson for his 
help in the English redaction of this paper and for the suggestion to add to it 
tables and diagrams, the latter of which have been skilfully executed by Miss Ida 
McLearn of the Galton Laboratory. 


TABLE I. Values of H,= 
n=10 | n=50 
Bo 6. 0 1 | 5 | 1°0 | 0 1 5 | 1°0 }* Bs 
15 0000 3721 | ‘8321 | — 0000 4300 “9615 1°5 
20 | -0000 | 2860 | -6396  -9045 | -0000 | -3100 | | -9802 2-0 
25 | -o000 | | 5388 | -7620 | -0000 | -2548 | -5697 | -8056 
3-0 | | ‘2121 | +4744 0000 | -2214 | -4950 | 7000 | 3-0 
35 | 0000 | -1917 4986 “6061 | ‘0000 | | | 6274 | 3-5 
| 0000 | 1762 | +3939 | -5571 | ‘0000 | +1813 | -4055 | +5735 | 4-0 
45 | 0000 | -1639 | -3665 | -5183 | 0000 | -1680 | +3758 | -s314 | 4-5 


N.B. Figures above a continuous line correspond to the U-shaped, and figures between a 
continuous and a dotted line to the J-shaped curves of Pearson. 
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TABLE I (continued). 


n=100 n=1000 
p, 0 1 1°0 1 | 1°0 
| 
0000 | | 9804 0000 | 4463 | 9980 15 | 
2-0 ‘0000 | *7001 9900 3159 | -7064 “9990 20 | 
25 0000 | +2565 | 5735 8111 0000 +2580 ‘5770 8160 25 
| 
3°0) 0000 | +2225 | ‘4975 “7036 “0000 +2235 *4997 7068 | 3:0 
| $°5 0000 | ‘1992 | 6299 1999 6322 | 3:5 
| 0000 | ‘4069 5754 | +0000 1825 | -4081 5772 40 
| 
‘0000 «+1685 3769 ‘5330 0000 | 1690 ‘3779 5344 
TABLE II. Values of 
n=10 n=50 | 
| 
1°5 4103 5754 | 1:0000 “2055 ‘4827 | 1:0000 15 | 
‘2075 ‘3663 7132 | 1-0000 0985 “3284 7088 “0000 2-0 | 
25 ‘0863 2624 ‘5665 8070 0404 2593 5754 8148 25 | 
30) ‘0000 ‘2127 4804 6882 ‘0000 2215 4962 7035 30 
‘0669 1990 4286 6100 | -0313 2000 6281 | 
“1216 ‘2075 ‘3987 5571 0579 “1885 4065 5735 40 | 
‘1677 “3836 5215 ‘0791 1836 3795 5321 | 
n=100 | n=1000 
Bo “0 5 1:0 0 1:0 Bo 
| 
1°65 ‘1476 -4659 | 1:0000 = 0474 4492 | 1:0000 1°5 
20) 0702 3235 7150 | 10000 | ‘0224 3169 “7072 20 
25 ‘0287 2588 5764 8156 | ‘0091 2583 ‘5773 8164 25 
3-0 | -o000 | -2295 | -4981 | -7053 | ‘0000 | 2235 | -4998 | -7o69 | 3:0 
0244 “2000 "4454 6303 0071 +2000 “4470 “6322 35 
40 0406 1856 “4074 ‘5754 0129 *1829 -4082 5772 40 
0563 1766 ‘3787 5333 0179 ‘3780 5344 ys 


N.B. Figures above a continuous line correspond to the U-shaped, and figures between a 
continuous and a dotted line to the J-shaped curves of Pearson, 
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TABLE ITI. 
H. H. 2 
Values of 100 x 
of 
n=10 n=50 
| 
Bo aif “0 5 1°0 | ‘0 5 1°0 
on 
15 | 100-0 30:8 | 100-0 20°6 15 
| 
20 | 100-0 39-0 19°6 18-2 | 1000 10°9 4°4 3-9 20 
25 | 100-0 15°7 95 | 108 100-0 3°5 20 | 22 25 
3-0 5 5-0 1 ‘5 10 | 30 
35 | 100-0 7:3 0 13 | 1000 16 0 «| 
4:0 | 100-0 27-9 2-4 ‘0 | 1000 7-4 ‘0 40 
| 45 | 100-0 47°8 8-7 100°0 1:9 2 
n=100 n= 1000 
a, 0 1 5 1°0 | ‘0 5 \ig, 
1-5 | 100-0 11°4 39 — | 1000 13 4 15 
| 1000 5°7 2-2 2-0 100-0 6 2 20 
25 | 1000 18 | 100-0 
3-0 0 2 0 3-0 
100°0 ‘0 100°0 ‘0 0 ‘0 
4°0 100°0 3°9 2 | 100-0 “4 0 40 
45 | 100-0 8-9 10 | 100-0 1:0 0 
| 


N.B. Figures above a continuous line correspond to the U-shaped, and figures between a 
continuous and a dotted line to the J-shaped curves of Pearson. 


* The value of 100 (H,?- H,*)/H,? at 8,=0, 8,=3 depends on the manner in which we approach 
this point. If we take m=(8, —3)/8,, then we have 
2n 
100 (H,? H,*)/H, (e+ i 7) 100 if m>o, 
i.e. if we approach along a curve tangential at 8,=3 to 8,=0; but if m be finite, i.e. if we approach 
along any curve passing through this point (8,=0, 8.=3) but not tangential to 8,=0, then 


100 (H,? - H,2)/H.2=0. 


MISCELLANEA. 


The Use of Biometric Methods applied to Craniology, being a critique of 
Professor Gordon Harrower’s ‘‘A Study ofthe Hokien and the Tamil 
Skull.” Trans. R. S. Edinburgh, Vol. Liv. Part 3, No. 13. (Issued July 6, 
1926.) 


By G. M. MORANT, D.Sc. 


THE craniometric methods which are used to-day by workers in the Biometric Laboratory are, 
with some additions and slight modifications, those which were devised in the early days of 
Biometry, and since Miss Fawcett’s paper in the first volume of Biometrika several thousand 
skulls have been measured in precisely the same way. Those methods were drawn up by 
Professor Karl Pearson with the object of providing data which would be comparable with 
the measurements of both French and German anthropologists—a consideration almost entirely 
neglected in earlier British craniometric work—and the fact that many different workers have 
employed them is a sure guarantee of their practicability. A great advance in descriptive 
technique was made when Crewdson-Benington published the first type contours of a racial 
series of skulls and Professor Pearson’s more recent coefficient of racial likeness is proving an 
indispensable aid to the statistical student. But the work involved in providing such complete 
descriptions and comparisons of adequately long series of crania is very great, and the 
accumulation of that sufficient bulk of evidence, which is required before statistically reliable 
conclusions can be drawn, proceeds at a rate which is painfully slow from the individual’s point 
of view. It is with a peculiar pleasure, therefore, that we welcome the publication of a paper 
discussing a collection of skulls by aid of the methods we are ourselves using. This study of 
Dr Harrower’s is, we believe, the first not issued from the Biometric Laboratory which gives 
type contours and coefficients of racial likeness and which adopts the measurements used there. 
The skulls dealt with are those of unclaimed coolies collected in Singapore immediately after 
death. There are 36 ¢ Chinese from the southern province of Foo-kien and 35 ¢ Tamils from 
the south of India and these samples, though smail, should provide good first approximations to 
the racial types. In many ways which might seem unimportant to anyone not well acquainted 
with the technique of measurement Professor Harrower has unfortunately deviated from 
biometric usage. The usual index letters denoting measurements were adopted but with certain 
alterations and one or two unusual definitions are particularly likely to lead to confusion. The 
writer says, ‘“‘I have followed the greater number of workers in making the measurement (the 
nasal height) from nasion to the most prominent point on the nasal spine,” but we do not know 
any other modern worker who follows that practice. Such a measurement is certainly not 
comparable with any taken in the Biometric Laboratory, or with Broca’s to the base of the spine 
or with the Frankfurt height, and it is not to be found in Martin’s detailed list. The capacities 
found by shot are not comparable with those given by Macdonell’s seed and water method. But 
a more serious discordance is found in the definitions of orbital diameters. It has been the 
custom of all craniometricians to take an orbital width along the major axis of the roughly 
elliptical cavity, though different investigators have employed different terminal points on the inner 
margin, and the orbital height perpendicular to this width. But Professor Harrower defines the 
height he has taken to be the greatest in a vertical plane—presumably perpendicular to the 
Frankfurt horizontal—and then the breadth is taken from the dacryon at right angles to the 
height. The latter is certainly not the same as Broca’s dacryal breadth and, as far as we are 
aware, both this height and breadth have not been previously employed in measuring a cranial 
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series. In protesting against what he calls “non-essential measurements” the writer of this 
paper remarks that, “the actual difference between the basi-apical (our H) and basi-bregmatic 
(our H’) heights is infinitesimal compared to the range of variation of either measurement in 
a homogeneous series,” and hence he only measures the latter and would apparently consider it 
comparable with the “basi-apical” height given by other workers. Such a statement displays 
an entire lack of appreciation of the effort that has been made to make these comparisons 
rigorously legitimate. The kind of fallacy to which it opens the door may be better illustrated 
in the case of orbital measurements. The question is, of course, whether the mean differences 
due to the use of different methods of measurement are infinitesimally small or not compared 
with the probable errors of the means. There are two orbital widths commonly taken in this 
laboratory, Broca’s from the dacryon (our 0,’) and the other from the pencil line completing the 
inner orbital margin (our 0,)*. The second always exceeds the first and the ¢ mean differences 
for series on which both have been taken are: Tibetan A 24 (37), Burmese A 4°6 (41), 1st Dynasty 
Egyptians 2°8 (33), Farringdon Street English 2°5 (81) and for the last series the standard devia- 
tion of O; is 1°56 + 08 and of O,' 1°67 + 09. It would obviously be absurd to suppose that the 
mean orbital measurements found by the two different methods are directly comparable. Con- 
siderations such as these have led to the conclusion that the measurements of different workers 
may never be compared unless they were taken in precisely the same way: the differences due to 
the use of different methods of measurement are by no means infinitesimally small compared with 
the probable errors of the means, the inter-racial differences and the inter- and intra-racial standard 
deviations. Hence we are obliged to duplicate measurements in order to provide comparable 
data. Wecan only suppose that conclusions drawn from the direct comparison of measurements 
that are not identical, as in this paper, are, in all probability, entirely fallacious. In Table II the 
means of the series of Chinese and Tamil skulls are compared with those of 14 other races and there 
is no indication of the papers in which the quoted measurements were first published. That 
omission is all the more to be regretted as the numbers of skulls on which the means were based 
are not given and the student who wishes to use them in computing coefficients of racial like- 
ness, or for other statistical purposes, must refer to the original studies. The means given for 
Turner’s Scottish skulls certainly represent a heterogeneous population. The Hokien facial 
index, wrongly given as 76°5, should be 73°8 ; the Tamil transverse arc, wrongly given as 321°6, 
should be 313-2 and the Tamil occipital index, wrongly given as 63°95, should be 62°2. In 
computing coefficients of racial likeness Professor Harrower used the standard deviations of the 
lony series of 26th—30th Dynasty Egyptian skulls measured and preserved in the Biometric 
Laboratory but no acknowledgement is made of that fact, and it would not be clear to anyone 
not well acquainted with the craniometric work done there that the greater number of the 
coefficients given are qroted from the pages of Biometrika. The sagittal and transverse type 
contours—the horizontal types not being provided—were prepared in the customary way with 
slight modifications. On the former the point of maximum subtense from the nasion-inion line is 
given instead of the point of maximum subtense from the glabella-inion line and on the latter 
the auricular points were located further inside the passages than is usual. 

The only comparative Asiatic material used in this paper is taken from the papers of 
Tildesley t—misspelt Tyldesley throughout—and Morantt. Professor Harrower was apparently 
not acquainted with a later biometric study of the Nepalese skull in which reduced mean skull 
measurements for a number of other Oriental races are given§. He says (p. 573): “China 
covers a vast expanse, much larger than most people realise, and its inhabitants include a large 
variety of branches which vary widely in craniological features....The dissimilarity between the 
Cantonese Chinese and the Hylam Chinese skull is, I believe, as great as that between the 


* See Biometrika, Vol. 1. p. 430 and Vol. vin. pp. 311—312. 
M. L. Tildesley, ‘‘ A First Study of the Burmese Skull,” Ibid. Vol. xur. 1921, pp. 176—262. 
G. M. Morant, ‘‘ A First Study of the Tibetan Skull,” Ibid. Vol. x1v. 1923, pp. 193—260. 
§ G. M. Morant, ‘‘ A Study of certain Oriental Series of Crania including the Nepalese and Tibetan 
Series in the British Museum (Natural History),” Ibid. Vol. xv1. 1924, pp. 1—104. 
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Scottish and the German skull.” But no comparisons with earlier studied series of Chinese skulls 
and, indeed, no evidence of any kind are brought forward in support of these statements. In the 
last-mentioned paper the mean measurements of three fairly long series of Chinese skulls from 
different parts of the country are given. Though not identical, they proved so similar that the 
conclusion that all could be considered local varieties of a single type seemed to be quite 
justified. It was surprising to find that samples drawn from such an enormous and widely 
scattered population should indicate racial homogeneity and the comparison with the Hokien 
series is of particular interest as it may confirm or refute Professor Harrower’s statement. The 
coefficients of racial likeness are given in the table below and it may be remarked that the lowest 
found by Professor Harrower between the Hokien Chinese and an Asiatic race was 5°57 + ‘17 
(corrected probable error) with the Tibetans of the A type though that is rather higher than it 
should be owing to the fallacious comparison of orbital measurements. 


Coefficients of Racial Likeness between Series of Chinese Male Skulls. 


| Northern Chinese | Hokien Northern Southern 
| Koganei (69-4)* Chinese (36-0) Chinese (3674) Chinese (52°5) 
Northern Chinese All Characters 1°54+°21 2°56 +°19 6715+°19 
Koganei (69°4) Indices and Angles | — | 0°35 + °61 1°58 +°32 5°45 + 32 
| | 
Hokien | All Characters 154+ 21 1°62 +°17 
Chinese (36-0) Indices and Angles | 0°35 +61 _ 2°10 + 26 1:24 +29 
Northern All Characters | 56+°19 1°62+°17 059 +°17 
Chinese (36:4) | Indices and Angles | 1°58 + 32 2:10 + 26 - 1:00 + 
| 
All Characters 615419 | 1764-18 0°59+°17 
Chinese (52°5) | Indices and Angles 5°45 + | 1°24+°29 1:00 + 


* The numbers in brackets are the mean numbers of skulls available for the characters used in 
computing the coefficients. 


All the coefficients except those between Koganei’s Northern Chinese and the Southern 
Chinese are of the order of those found between local varieties of the same race or between very 
closely allied races, so comparison with the Hokien material enables us to re-affirm the belief 
that the immense population of China belongs predominantly to a single racial type. Making 
the kind of comparison Professor Harrower suggests between a British (Farringdon Street 
English (95°6)) and a German (Ranke’s Altbayerisch (77:0)) ¢ series we find coefficients of 
43°92 + ‘19 for 23 characters and 58°97 + ‘33 for 7 indices and angles, results of a widely different 
order. The longest series of Tamil skulls previously measured appears to have been a short one 
of 13 ¢ specimens co!lected by the brothers Sarasin*. The coefficients between the Tamils from 
Ceylon (12°4) and Harrower’s series collected in Singapore (35:0) are 2°25 + °19 for 21 characters 
and 3°54 + ‘32 for 8 indices and angles, but it would be unwise to conclude that there is a real 
difference between the island and mainland peoples since the samples are so small. The latter 
series gives coefficients of 5°66 +°18 with Veddah meanst (24:4) of 27 characters and 4:22 + ‘30 


* Paul and Fritz Sarasin, Ergebnisse naturwissenschaftliche Forschungen auf Ceylon. Wiesbaden, 
1892-4. 

¢ Given in Biometrika, Vol. xvi. p. 48 and reduced from the measurements of P. and F. Sarasin 
(op. cit.) with the addition of a few others. 
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when 9 indices and angles are compared. With the reduced Maravar (Dravidian) means* (25°9) 
the Tamils (35°0) give coefficients of 5°82 + -23 for 16 characters and 2°24 + °67 for 3(!) indices. 
As these are the lowest coefficients that can be found, little can be said about the affinities of the 
Tamil skull and no craniological classification of the Dravidian peoples of India can be attempted 
while the material is so meagre. 

Commenting on the use of the coefficient of racial likeness, Professor Harrower says: 
“a, skull may be an exact replica of another, but, say, 10 per cent. smaller in every measurement. 
Thec.R.L. between the two skulls would therefore be much greater than the c.R.L. between two skulls 
which had half the measurements exactly equal, one-fourth 10 per cent. above and the remaining 
fourth 10 per cent. below: skulls which would be entirely different in general outline” (p. 586). 
We may remark, in the first place, that no one has ever suggested finding coefficients between 
single skulls and that if conditions similar to these hypothetical ones were found for mean types— 
and from what little we know of inter-racial correlations the probability of finding such would 
seem to be very small—then it is by no means evident why the first coefficient should be much 
greater than the second. Indeed, the first might well be smaller than the second. The separate 
computation of coefficients for indices and angles only was, of course, suggested to provide 
a measure of differences of shape apart from differences of size. One or two minor points may 
be noticed. It is contrary to biometric usage to give standard deviations without probable errors 
as on p. 581. The frontal index defined on the previous page should be preceded by 100 and the 
legitimacy of its use may be questioned as the median sagittal are from nasion to bregma has not 
the form of a catenary if the superciliary ridge is at all well developed. We fail to find any 
evidence adduced to support the statement—made on p. 597—that “the Tibetan A group 
is hybrid and the Hokien is a pure type,” and it is unworthy of a scientific study to draw 
conclusions such as the following, involving theories which no one has ever attempted to 
demonstrate. ‘“ Intellectually,” we are told, “as judged by his cranial capacity, he (the Hokien) 
is capable of attaining to the highest standard, but the very narrow frontal region (giving the 
lowest value of all the groups compared) indicates that his powers of initiative and intelligence 
are relatively low” (p. 597). We are sorry to see such undisciplined theorising cheek by jowl 
with valuable biometric measurement. Anthropologists are indebted to Professor Harrower, 
however, for having provided a craniological record which will be of some permanent value. 


* Biometrika, Vol. xvi. p. 28. 
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Pulmonary Tuberculosis : the Mortality of 
the Tuberculous: Sanatorium and Tuber- 
culin Treatment. By W. ELpErRTOoN, 
F.LA., and J. Perry, 
Price 4s, net. 

IX. A Statistical Study of Oral Tem- 
peratures in School Children with special 
reference to Parental, Environmental and 
Class Differences. By M. H. Witttams, 
M.B., Bent, M.A., and Karu 
Pearson, F.R.S. Price 6s. net. 

X. Study of the Data provided by a 
Baby-Clinic in a large Manufacturing Town. 
By Mary Nort Karn and Kari Prarson. 
Price 15s. net. 

XI. Blood Pressure in Early Life. A 
Statistical Study. By Percy Stocks, M.D., 
D.P.H., assisted by M. Norn Karn. Price 
12s. net. 


Technical Series. 


On a Theory of the Stresses in Crane 
and Coupling Hooks with Experimental 
Comparison with Existing Theory. By 
E. S. Anprews, B.Sc. Eng., assisted by 
Karu Pearson, F.R.S. Isswed. Price 4s. net. 

On some Disregarded Points in the 
Stability of Masonry Dams. By L. W. 
ATCHERLEY, assisted by Kart PEaARsON, 
F.R.S. Issued. Price 7s. net. Sold only 
with complete sets. 

On the Graphics of Metal Arches 
with special reference to the Relative 
Strength of Two-pivoted, Three-pivoted 
and Built-in Metal Arches. By L. W. 


II. 


ITI. 


ATCHERLEY and Kari F.R.S. 
Issued. Price 5s. net. 

On Torsional Vibrations in Axles 
and Shafting. By Karu Pearson, F.R.S. 
Tssued. Price 6s. net. 


IV. 


V. An Experimental Study of the 
Stresses in Masonry Dams. By Karu 
Pearson, F.R.S., and A, F. CAMPBELL 
PoLLARD, assisted by C. W. WHEEN, B.Sc. 
Eng., and L. F. Ricnarpson, B.A. Issued. 
Price 7s. net. 


VI. Ona Practical Theory of Elliptic and 
Pseudo-elliptic Arches, with special refer- 
ence to the ideal Masonry Arch. By Karu 
Pearson, F.R.S., W. D. Reynotps, B.Sc. 
Eng., and W. F. Sranton, B.Sc. Eng. 
Issued. Price 6s. net. 


VII. On the Torsion resulting from Flexure 
in Prisms with Cross-sections of Uni-axial 
Symmetry only. By A. W. Youne, Eraen 
M. Experton and Kart Prarson, F.R.S. 
Issued. Price 7s, 6d 
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II. 


ITI. 


IV. 


VI. 


II. 


ITI. 


VI. 


VII. 


VIil. 


Ix. 


Drapers’ Company Research Memoirs—(cont.). 
Questions of the Day and of the Fray. 
The Influence of Parental Alcoholism | VII. MendelismandtheProblemof Mental 


on the Physique and Ability of the Off- 
spring. A Reply to the Cambridge Econo- 
mists. By Kart Pearson. Price 1s. 6d. net. 
Mental Defect, Mal-Nutrition, and 


Defect. I. A Criticism of Recent American 
Work. By Davin Heron, D.Sc. (Double 
Number.) Price 2s. net. 


| VIIL. MendelismandtheProblemofMental 


the Teacher’s Appreciation of Intelligence. | 


A Reply to Criticisms of the Memoir on 


‘The Influence of Defective Physique and | 


Unfavourable Home Environment on the 
Intelligence of School Children.’ By Davip 
Heron, D.Se. Price 1s. 6d. net. 


An Attempt to correct the Misstate- | 


ments made by Sir Vicror Horstey, F.R.S., 
F.R.C.S., and Mary D. Sturer, M.D., in 
their Criticisms of the Memoir: ‘A First 
Study of the Influence of Parental Alcohol- 
ism,’ &c. By Kart Pearson. Price 1s, 6d. net. 


The Fight against Tuberculosis and | 


the Death-rate from Phthisis. By Kari 
Pearson, F.R.S. [Out of print 
Social Problems: Their Treatment, 


Past, Present and Future. By Kart | 


Pearson, F.R.S. Price 1s. 6d. net. 
Eugenics and Public Health. Lecture to 

the York Congress of the Royal Sanitary In- 

stitute. By Kart Pearson. Price 1s. 6d. net. 


Defect. II. The Continuity of Mental 
Defect. By Kart Prarson, F.R.S., and 
Gustav A. JAEDERHOLM, Ph.D. Price 
ls. 6d. net. 


IX. Mendelismand the Problemof Mental 


Defect. ILI. On the Graduated Character 
of Mental Defect, and on the need for 
standardizing Judgments as to the Grade 
of Social Inefficiency which shall involve 
Segregation. By Kart Pearson, F.R.S. 
(Double Number.) Price 2s. net. 


X. The Science of Man. Its Needs and its 


Prospects. By Kart Pearson, F.R.S. Being 
the Presidential Address to Section H of 
the British Association, 1920. Price 1s. 6d. 
net. 


Xl. Francis Galton, A Centenary Appre- 


ciation. With portrait sketch. By Karn 
Prarson, F.R.S. Price 2s. net. 


XII. Charles Darwin, 1809-1882. An Ap- 


preciation. With an unpublished portrait. 
By Karu Pearson, F.R.S. Price 2s. 6d. net. 


Eugenics Laboratory Publications 


MEMOIR SERIES. 


The Inheritance of Ability. By Encar 
Scuuster, D.Sc., Formerly Galton Research 
Fellow, and M. Galton 
Scholar. [Out of Print 

A First Study of the Statistics of 
Insanity and the Inheritance of the Insane 
Diathesis. By Davip Heron, D.Sc., Form- 
erly Galton Research Fellow. [Out of Print 

The Promise of Youth and the 


Performance of Manhood. By Epaar | 


Scuuster, D.Sc., Formerly Galton Research 


Fellow. [Out of Print | 


On the Measure of the Resemblance 
of First Cousins. By Ernen M. 


Galton Research Fellow, assisted by Kart | 


Pearson, F.R.S. Price 5s. net. 


A First Study of the Inheritance of | 
Vision and of the Relative Influence of | 


Heredity and Environment on Sight. By 
Amy Barrineton and Kart PErarson, 
F.R.S. Price 5s. net. 

Treasury of Human Inheritance. 
Parts I and 11 (double part). (Diabetes in- 
sipidus, Split-Foot, Polydactylism, Brachy- 
dactylism, Tuberculosis, Deaf-Mutism, and 
Legal Ability.) Price 14s. net. 

On the Relationship of Condition of 
the Teeth in Children to Factors of Health 
and Home Environment. By E. C. Ruopgs, 
B.A. Price 9s. net. 

The Influence of Unfavourable Home 
Environment and Defective Physique on 
the Intelligence of School Children. By 
Davip Heron, M.A., D.Sc., Formerly Galton 
Research Fellow. Out of print 

The Treasury of Human Inheritance. 
Part III. (Angioneurotic Oedema, Herma- 
phroditism, Deaf-Mutism, Insanity, Com- 
mercial Ability.) Price 10s. net. 

The Influence of Parental Alcoholism 
on the Physique and Intelligence of the 


Offspring. By M. as- | 


sisted by Kart Pearson. Second Edition. 
Price 68. net. 


| 
| 
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XI. 


XII. 


XIII. 


XV. 


XVI. 


XVII. 


The Treasury of Human Inheritance. 
Part 1V. (Cleft Palate, Hare-Lip, Deaf- 
Mutism, and Congenital Cataract.) Price 
10s. net. 

The Treasury of Human Inheritance. 
Parts Vand VI. (Haemophilia.) Price 15s. 
net. 

A Second Study of the Influence of 
Parental Alcoholism on the Physique and 
Intelligence of the Offspring. By Karn 
Prarson, F.R.S., and M, 
Price 6s. net. 

A Preliminary Study of Extreme 
Alcoholism in Adults. By Amy Barrine- 
Ton and Karu Prarson, F.R.S., assisted 
by Davip Heron, D.Sc. Price 6s. net. 

The Treasury of Human Inheritance. 
Dwartfism, with 49 Plates of Illustrations 
and 8 Plates of Pedigrees. Price 15s. net. 

The Treasury of Human Inheritance. 
Prefatory matter and indices to Vol. I. 
With Frontispiece Portraits of Sir Francis 
Galton and Ancestry. Price 5s. net. 

A Second Study of Extreme Alco- 
holism in Adults. With special reference 
to the Home-Office Inebriate Reformatory 
data. By Davin Heron, D.Sc. Price 7s. 6d. 


net. 
XVIII. On the Correlation of Fertility with 


Social Value. A Cooperative Study. 
Price 7s. 6d. net. 


xXIX—X=X. Report on the English Birthrate. 


Part I. England, North of the Humber. 
By Eruet M. Galton Research 
Fellow. Price 9s. net. 


XXI. The Treasury of Human Inheritance. 


Vol. II (Nettleship Memorial Volume). 
Heredity of Anomalies and Diseases of the 
Eye. Part l. Price 45s. net. 


XXII. TheTreasury of Human Inheritance. 


Vol. III. Part I. Hereditary Disorders of 
Bone Development. Price 45s. net. 


XXIII. TheTreasury of Human Inheritance. 


ii) 


Vol. II (Nettleship Memorial Volume). 
Part II. Colour Blindness. Price 45s. net. 
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Eugenics Laboratory Publications—(cont.). 


Vol. I of The Treasury of Human Inheritance. 


Buckram cases for binding can be purchased at 5s. with 


impress of the bust of Sir Francis Galton. An engraved portrait of Sir Francis Galton can be obtained by sending 
a postal order for 3s. 6d. to the Secretary to the Laboratory, University College, London, W.C 


Noteworthy Families. By Francis Garon, F.R.S. and Epgar Scuusrer. 
Life History Album. By Francis Gatton, F.R.S. Second Edition. 


(Reissue.) Price 9s. net. 
(New Issue.) Price 12s. net. 


LECTURE SERIES. 


a The Scope and Importance to the 
State of the Science of National Eugenics. 
By Kart Pearson, F.R.S. Third Edition. 
Price 1s. 6d. net. 


II. TheGroundwork of Eugenics. By Karu 
Prarson, F.R.S. Second Edition. Price 
ls. 6d. net. 


III. The Relative Strength of Nurture and 
Nature. Much enlarged Second Edition. 
Part I. The Relaiive Strength of Nurture 
and Nature. (Second Edition revised.) By 
Eruet M. Exprerton. Part lI. Some Recent 
Misinterpretations of the Problem of Nurture 
and Nature. (First Issue.) By Karu PEar- 
son, F.R.S. Price 2s. 6d. net. 

IV. . On the Marriage of First Cousins. By 
Eruet M. Experron. Price 1s. 6d. net. 


V. The Problem of Practical Eugenics. 
By Kart Pearson, F.R.S. Second Edition. 
Price 1s. 6d. net. 


VI. Nature and Nurture, the Problem of 
the Future. By Kart Pearson, F.R.S. 
Second Edition. Price 1s. 6d. net. 


VII. The Academic Aspect of the Science 
of National Eugenics. By Kart Prarson, 
F.R.S. Price 1s. 6d. net. 

VIII. Tuberculosis, Heredity and Environ- 


ment. By Karn Pearson, F.R.S. Price 
ls. 6d. net. 

IX. Darwinism, Medical Progress and Eu- 
genics. The Cavendish Lecture, 1912. By 
Kart Pearson, F.R.S. Price 1s. 6d. net. 

xX. The Handicapping of the First-born. 

By Karu Pearson, F.R.S. Price 2s. 6d. net. 

XI. National Life from the Standpoint of 
Science. (Third Issue.) By Kart PEarson, 
F.R.S. Price 1s. 6d. net. 


XII. The Function of Science in the Modern 
State. (New Issue.) By Karn Pearson, 
F.R.S. Price 2s. net. 

XIII. Sidelights on the Evolution of Man. 


By Kart Pearson, F.R.S. Price 3s. net. 


Walter Raphael Weldon. 1860—1906. A Memoir. By Kart Prarson, F.R.S. Copies may be obtained, 
Price 6s. net, from the Secretary, Biometric Laboratory (6d. postage). 


A few copies of the following are still available: 


The Skull and Portraits of Sir Thomas Browne. By Miriam Tinpestey. Price Three Guineas. 
The Skull and Portraits of King Robert the Bruce. By Kart Pearson. Price One Guinea. 


The Skull and Portraits of George Buchanan. 


3y Kart Pearson. Price One Guinea. 


Application should be made to the Secretary, Biometric Laboratory, University College, London, W.C. 1. 


At the Cambridge University Press, Fetter Lane, E.C. 4. 


The Chances of Death and other Studies in Evolution 
By KARL PEARSON, F.R.S. Reissue. Price 80/- net. 


Vou. I 
1. The Chances of Death. 2. The Scientific 
Aspect of Monte Carlo Roulette. 3. Reproduc- 
tive Selection. 4. Socialism and Natural Selec- 
tion. 5. Politics and Science. 6. Reaction. 
7. Woman and Labour. 8. Variation in Man 
and Woman. 


II 

9. Woman as Witch. Evidences of Mother- 
Right in the Customs of Mediaeval Witchcraft. 
10. Ashiepattle, or Hans seeks his Luck. 11. 
Kindred Group Marriage. Part I. Mother Age 
Civilisation. Part II. General Words for Sex 
and Kinship. Part III. Special Words for Sex 
and Relationship. 12. The German Passion 
Play: A Study in the Evolution of Western 
Christianity. 


Mounted Charts of the Weight and Health 
of Male and Female Babies 


Price 7s. 6d. net the pair, suitable for the walls of Baby-Clinics, or for plotting the 
growth of individual babies to mark their progress. 


The following works prepared in the Biometric Laboratory 
can be obtained from H.M. Stationery Office. 
The English Convict, A Statistical Study. By CHartes Gorine, M.D. 


Text. Price 9s. 


Tables of Measurements (printed by Convict-Labour). Price 5s. 


The English Convict. An Abridgment, with an Introduction by Kart Pearson, F.R.S. Price 3s. 
Tables of the Incomplete [-Function. Edited with an Introduction by Kart Pearson, F.R.S. 
Price £2. 2s. Od. or by Post £2. 2s. 9d. 
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NEW ISSUES OF THE GALTON AND BIOMETRIC 
LABORATORIES 


THE TREASURY OF HUMAN INHERITANCE. 


Vo. II. Part II. (Nettleship Memorial Volume.) Colour Blindness. 
By Juxta Bett, M.A., M.R.C.S., L.R.C.P. 143 pp. of Text, Chrono- 
logical Bibliography of 425 titles, Figures of 235 pedigrees on 15 


Plates and frontispiece portrait of John Dalton. Price Forty-five 
shillings net. 


“The name of Edward Nettleship is among the great ones of ophthalmology. He was in the 
succession of Bowman, von Graefe and Donders, but his mind was of a philosophical cast, and he 
was particularly interested in hereditary defects, and comparatively early in life he retired from 
practice in order to devote himself entirely to the study of this subject. His pupils founded a 
medal in his honour, which, at his request, is awarded for the encouragement of scientific and 
ophthalmic work. His best memorial, however, has been erected by Professor Karl Pearson, who 
has devoted one of the fine volumes of 7he Treasury of Human Inheritance to the subject of 
hereditary diseases and anomalies of the eye, to stand as a Nettleship Memorial Volume and 
record of Nettleship’s own work and that of his immediate students and friends. Part I, which 
was published in 1922...contained an account of the life of Nettleship and dealt with the subjects 
of retinitis pigmentosa and allied conditions, congenital stationary night-blindness and glioma 
retinae. Part II, which is now published, is devoted to colour-blindness, and under the capable 
authorship of Dr Julia Beil exhibits to the full the high standard of workmanship that we are 
accustomed to expect of the Cambridge University Press. The volume, indeed, may be considered 
as an dition de luxe, which will give equal satisfaction to the bibliophile and the man of science. 
An excellent reproduction of C. Turner’s engraving of the portrait of Dalton by Lonsdale forms 
an appropriate frontispiece, since Dalton was the first to give a scientific description of colour- 
blindness.”—Lritish Medical Journal. 


“ Diese grossziigige, dem Gediachtnis Nettleships gewidmete Monographie bringt an der Hand 
einer 425 Nummern umfassenden Literatur und der Reproduktion von 235 Stammbaumen, die 
alle naher nach den Originalien erlautert werden, eine ausgezeichnete Darstellung der Erblich- 
keitsverhaltnisse der angeborenen Farbenblindheit. Klar und schén ist auch die einleitende 
historische Darstellung der Lehre vom Farbensinn. Man kann die Verfasserin und den Verlag 
nur begliickwiinschen zu dieser auch in der Form hervorragenden Leistung, der vollstandigsten, 
die unser Fach besitzt und die dies fiir die Erblichkeitslehre so wichtige Kapitel in einer sehr er- 
wiinschten Weise zusammenfasst.’—Prof. AXENFELD in Kdinische Monatsblitter f. Augenheilkunde. 


Shortly: Zhe Right of the Unborn Child. Being a Lecture to London 
School Teachers. By Kart PEARSON, F.R.S. 


Tracts for Computers 


Xi. BIBLIOTHECA TABULARUM MATHEMATICARUM 
being a Descriptive Catalogue of Mathematical Tables. Part I. 
Logarithms of Numbers. By JAMEs HENDERSON, M.A. Double Number. 
Price 9s. net. Just out 

XIV. LOGARITHMETICA BRITANNICA. A Standard Table of 


Logarithms to 20 Decimal Places. By A. J. THompson. Part VIII. 
Numbers 80,000 to 90,000. Shortly 


Cambridge University Press Fetter Lane, E.C. 4 
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AT THE CAMBRIDGE UNIVERSITY PRESS 


THE LIFE, LETTERS, AND LABOURS 
OF FRANCIS GALTON 


By KARL PEARSON, F.R.S. 


GALTON PROFESSOR, UNIVERSITY OF LONDON 


Volume I. Birth 1822 to Marriage 1853. With 5 Pedigree Plates 
& 72 Photographic Plates, Frontispiece & 2 Text-figures 


Price, Bound in Buckram, 36s. net 


“Tt is not too much to say of this book that 
it will never cease to be memorable. Never 
will man hold in his hands a biography 
more careful, more complete.”—The Times 
“A monumental tribute to one of the most 
suggestive and inspiring men of modern 
times.”— Westminster Gazette 


“Tt was certainly fitting that the life of the 
great exponent of heredity should be written 
by his great disciple, and it is gratifying 
indeed to find that he has made of it, what 
may without exaggeration be termed a great 
book.”—Daily Telegraph 


Volume II. Letters and Labours of Middle Life. With 50 Plates 
& many Figures in the Text 


Price, Bound in Buckram, 45s. net 


Cuaprer VIII. Transition Studies: Art 
of Travel, Geography, Climate. 


Cuapter IX. Early Anthropological Re- 


Cuapter XI. Psychological Investiga- 
tions. Transition from Physical to 
Psychical Anthropology. 


searches. Transition from Geography Researches 

and Portraiture. 

CuapTER XIII. Early Statistical Investiga- 
tions with regard to Anthropology. 
Transition to Statistics as funda- 
mental to Biological Enquiry. 


Cuapter X. The Early Study of Heredity: 
Correspondence with Alphonse de 
Candolle and Charles Darwin. 


“For the student of the History of Science, as well as for the student of Galton, this 
volume is of prime importance....... The volume is important and deeply interesting. 
It is splendidly illustrated.””—Glasgow Herald 


“Galton’s personality and achievements have taken their place in the history of 
science, and more than justify the sumptuous ‘Life, Letters, and Labours’ on which 
Professor Pearson has lavished special knowledge and labour.”—The Times Literary 
Supplement 

“Tt is a wholly worthy memorial of a very great man.”—Science 


“We prophesy that Pearson’s Life of Galton will be ranked by our descendants not 
very far behind Boswell’s‘ Johnson,’ and Trevelyan’s‘ Macaulay’.”—British Medical Journal 


“If our race continues to progress in the right direction, our descendants of, say, five 

_or ten centuries hence will be insatiable in their need of information about such men 
as GALTON and DARWIN. They will bless Pearson for his devotion. If the great- 
ness of a man is to be measured by the product of his originality by his energy—and 
this seems the right way of measuring it—GALTON is certainly a very great man and 
his greatness will increase and not decrease as years and centuries go by.”—Isis 
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The new Journal of the Galton Laboratory for National Eugenics, 
size large quarto. Vol. I, Parts I—IV, has appeared : 


ANNALS OF EUGENICS 


A JOURNAL FOR THE SCIENTIFIC 
STUDY OF RACIAL PROBLEMS 
Edited by Kart PEARSON, assisted by ETHEL M. ELDERTON 


I have no Faith in anything short of actual Measurement and the Rule of Three.—cuar.es DARWIN 


ISSUED BY THE GALTON LABORATORY FOR NATIONAL EUGENICS, UNIVERSITY 
OF LONDON, AND PRINTED AT THE UNIVERSITY PRESS, CAMBRIDGE 


VOLUME of the Annals of Eugenics will contain about 400 pages with plates 
A tables. An endeavour will be made to issue a volume annually. 
Papers for publication should be sent to Professor Kart Pearson, University 
College, London, England. It is a condition of publication in the Annals that the 
paper should not already have been issued elsewhere, and will not be reprinted 
without leave of the Editors. Diagrams and drawings should be sent in a state 
suitable for photographic reproduction, and if on decimal paper it should be blue 
ruled, and the lettering only pencilled. Papers will be accepted in French, Italian 


or German. In the last case the manuscript should be in Roman, not German, 


characters. 

Contributors receive 25 copies of their papers free. Fifty additional copies in 
wrapper may be had on payment of 21s. per sheet of eight pages, and 80 copies of 
four pages without wrapper 9s., with an extra charge for Plates. Orders for ad- 
ditional copies should be sent when the final proof is returned. 

The subscription price, payable in advance, is 50s. net per volume (including 
packing and postage). All subscriptions must be prepaid, i.e. paid before issue of 
the first part of a volume. After issue parts can only be obtained at wrapper prices. 
Cheques should be made payable to the Galton Laboratory for National Eugenics, 
they should be crossed ‘“‘ Annals of Eugenics ¢/c,”’ and sent to The Secretary to the 
Galton Laboratory, University College, London, W.C. 1. No foreign cheques can 


be accepted unless they are drawn in sterling, properly stamped, and payable at a 
London agency. 


I desire to subscribe to the 
above journal issued by the Francis Galton Laboratory for National Eugenics at the 
Subscription Price of 50s. 


ADDRESS 


SIGNATURE 
To 'THE SECRETARY TO THE 
GaLton Evernics Lasoratory, UNIvERsIty CoLLEGE, Lonpon, W.C. 1 
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ANNALS OF EUGENICS 


A JOURNAL FOR THE SCIENTIFIC 
STUDY OF RACIAL PROBLEMS | 


EDITED BY 
K. PEARSON and E. M. ELDERTON 


Volume I, Parts ITT & IV. May, 1926. 35/- net (including postage) 


CONTENTS 


I. Studies of Palaeolithic Man. (1) The Chancelade Skull and its Relation to the Modern Eskimo Skull. 
By G. M. Morant, M.Sc. (With three figures in text, ten photographic plates and three tissue contours in 
pocket.) IT. On the Relative Value of the Factors which influence Infant Welfare. An Inquiry by Ernen 
M. Extperron based on Data provided by Dr A. G. AnpErson (M.O.H. Rochdale), Dr W. A. Evans 
(M.O.H. Bradford), Dr A. Greenwoop (M.O.H. Blackburn), Dr H. O. Pinkrneton (M.O.H. Preston) and 
Dr C. H. Tarrersaui (M.O.H. Salford). (With sixty-two diagrams in the text.) Part II (cont.). Factors of 
Infant Viability ; (9) Feeding of the Baby; (10) Storage of Milk, Treatment of Milk, Cleanliness of Bottle 
and Type of Bottle ; (11) Influence of Wage of Father and Regularity of his Employment ; (12) Influence 
of Housing Conditions ; (13) Infant Viability and the Various Factors in all Districts ; (14) Summary of 
Conclusions. ITI. On our present Knowledge of the Relationship of Mind and Body. By Karu Pearson, 
F.R.S. IV. On the Influence of Sanatorium and Dispensary Treatment and Housing Conditions on Pul- 
monary Tuberculosis. By Percy Stocks, M.D., assisted by M. Nort Karn. (With seven figures in the 
text.) (i) Introductory ; (ii) Material ; (iii) Assessment of the Initial Condition ; (iv) Assessment of Progress 
of Patients ; (v) Classification of Treatment ; (vi) Graphic Comparison of Progress of Sanatorium-treated 
and other Cases ; (vii) Comparison of Progress of Sanatorium-treated and other Cases on ‘‘Exposed Risk ” 
Basis ; (viii) Influence of Treatment and Home Conditions on Progress—Correlation Method ; (ix) Con- 


clusions, V. A Pedigree of Paralysis Agitans. By Junia Bewi and A. J. Cuark, M.D. (With one figure 
in the text and two plates.) 


Volume I, Parts 1 & II. October, 1925. 35/- net (including postage) 


CONTENTS 


“ Annals of Eugenics” Portrait Series, No. I. Mautruus, Frontispiece. Foreword, Editorial. I, The 
Problem of Alien Immigration into Great Britain, Illustrated by an Examination of Russian and Polish 
Jewish Children. By Kart Pearson and Margaret Mout. (With fifty diagrams in the text.) Part I. 
(i) Introductory ; (ii) On the Parentage of the Alien Jewish Children ; (iii) On the Racial Homogeneity 
of the Alien Jewish Children ; (iv) Comparative Physique of Alien Jewish and Native Gentile Children ; 
(v) Habits of Alien Jewish and Native Gentile Children. Part II. On the Intelligence of the Alien Jewish 
Children. (A) Introductory ; (B) Relation of Intelligence to Physical Characters; (C) Correlation of 
General Factor of Health and Intelligence ; (D) Intelligence and Pathological Characters. Intelligence and 
Home Conditions ; (E) Results checked by Place in Class; (F) Relative Intelligence of Alien Jewish and 
Native Gentile Children. II. A Pedigree of Epicanthus and Ptosis. By C. H. Usuer. (With diagram 
of fourteen pedigrees and four plates.) III. On the Relative Value of the Factors which influence Infant 
Welfare. An Inquiry by Ernst M. Evperton based on Data provided by Dr A, G. ANpERson (M.O.H. 
Rochdale), Dr Wm Arnotp Evans (M.O.H. Bradford), Dr ALFrep Greenwoop (M.O.H. Blackburn), 
Dr H. O. Pirxineiron (M.O.H. Preston) and Dr C. H. Tarrersati (M.O.H. Salford). I. Preliminary 
Discussion. II. Influence of Environment and Parental Factors on Viability of Baby: (1) Health of 
Parents and Infant Viability ; (2) Habits of Parents and Infant Viability ; (3) Condition of Home and 
Infant Viability ; (4) Occupation of Father and Infant Viability ; (5) Place in Family and Infant Viability ; 
(6) Age of Mother at Birth of Child; (7) Place in Family and Age of Mother as corrections for Infant 
Viability ; (8) Employment of Mother and Infant Viability. IV. A Note on the Correlation between 


Birth- and Death-Rates with reference to Malthus’s Interpretation of their Movements. By Antony B. 
Hitt, B.Se. 


Issued by 
THE GALTON LABORATORY FOR NATIONAL EUGENICS 
University College, London, W.C.1 
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PRESS NOTICES. ANVALS OF EUGENICS, VOL. I. 


“Das Pearsonsche Institut hat der Wissenschaft von der Biologie des Menschen mit der Herausgabe 
dieses Archives ein grosses Geschenk gemacht....Der erste Band enthilt eine Reihe grundlegender 
Arbeiten....Das Archiv ist nicht nur fiir den Anthropologen sondern auch fiir den Sozialhygieniker und 
Wohlfahrtspolitiker von grésstem Wert.”—Mitteilungen der Wiener Anthropologischen Gesellschaft. 


“We wish the new Journal that prosperity which the merit of its first numbers should ensure, and 
heartily congratulate the editors on its form and contents.”—-The Lancet. 


“The history of pious foundations in England tells many stories of endowments successfully used for 
ends of which the founder did not dream; some stories of transformations hard to distinguish from 
breaches of trust; not very many where, although the original intentions of the founder are faithfully 
carried out, they fulfil an even more useful purpose than he had contemplated. In this select class Francis 
Galton’s endowment has an honourable place...there is an obvious advantage in having a journal wholly 
devoted to the scientific study of racial problems. This fact has led Professor Karl Pearson to undertake 
the publication of a new Journal, the Annals of Hugenics, the first two parts of which have recently 
appeared. The format of the new Journal is, like that of all books printed at the Cambridge University 
Press, admirable and the size of page is suitable for the reproduction of charts and pedigrees....We are 
sure that this new Journal will receive a hearty welcome, and offer the editor and his collaborators our 
sincere good wishes for the success of the undertaking.—British Medical Journal. 


Annals of Eugenics. Vol. II. Parts I and II now at press will shortly be issued. Price 35s. net. 
Subscription Price before issue 50s. per volume net. 


At the Biometric Laboratory, University College, London. 


TABLES FOR STATISTICIANS AND 


BIOMETRICIANS. | Edited by KARL PEARSON, F.R.S. 


The new edition of this book will consist of two Parts 


Part I embraces the First Edition carefully revised. It has now been issued 
Price 15s. net, plus 1s. postage to any address, and may be obtained direct from the 
Biometric Laboratory, University College, London, or through any bookseller. 


Part II will contain all the Tables issued in Biometrika during the last ten years 


together with a number of Tables not yet published, but at present being computed. 
It is hoped to issue it this year. 


PRESS NOTICES OF THE FIRST EDITION 


“To the workers in the difficult field of higher statistics such aids are invaluable. Their calculation and 
publication was therefore as inevitable as the steady progress of a method which brings within grip of mathe- 
matical analysis the highly variable data of biological observation. The immediate cause for congratulation is, 
therefore, not that the tables have been done but that they have been done so well The volume is in- 
dispensable to all who are engaged in serious statistical work.” —Science 


“The whole work is an eloquent testimony to the self-effacing labour of a body of men and women who 
desire to save their fellow scientists from a great deal of irksome arithmetic; and the total time that will be 
saved in the future by the publication of this work is, of course, incalculable To the statistician these 
tables will be indispensable.” —Fournal of Education 


“The issue of these tables is a natural outcome of Professor Karl Pearson’s work, and apart from their 
value for those for whose use they have been prepared, their assemblage in one volume marks an interesting 


stage in the progress of scientific method, as indicating the number and importance of the calculations which 
they are designed to facilitate.” —Post Magazine 
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BIOMETRIKA. ‘Vol. XVIII, Parts Ill and IV 


CONTENTS 


PAGE 
“ Biometrika ” Portrait Series, No, IV. 1874— 


1926... : Frontispiece 
I. On the Skull and Portraits of Gaeas Bdhanan: By Kant PEARSON, F.R.S, 

(With thirty-two pee. five illustrations in the text and three tissues in 


II. Further Notes on By J. Ph. D. 257 


. On the Probable Error of the Mode of Skew Frequency Distributions. By 
Professor KazuTaro M.Sc. two and seven 
diagrams in the text) . : . 263 


Ueber die Anwendung der («Variate Difference 
Method”) bei Reihenausgleichungen, Stabilititsuntersuchungen und 


Korrelationsmessungen. Erster Theil. By Professor OskAR ea 
- Varna . ; 


V. On the Means and Squared Standard- Devintiows of Small Samples from any 


Population. By A. E. R. M.A. nine in the 


VI. Table of the Ratio: Area to Bounding Ordinate, for ei Pee of the 
Normal Curve. By P. 


VII. On the Correlation of the Mean and the Vacianosi in os ice from 
an “Infinite” Population. J. Ph.D., two 
figures in the text) : : 


Miscellanea: 


The Use of Biometric Methods applied to Craniology. Review of “ A Study 
of the Hokien and the Tamil Skull. By Professor Gorpon ae 
Singapore.” By G. M. Moranr . 


= 

The publication of a paper in Biometrika marks that in the Editors’ opinion it contains either in 
method or material something of interest to Biometricians. But the Editors desire it to be distinctly 
understood that such publication does not mark assent to the arguments used or to the conclusions 
drawn in the paper. 

A volume of Biometrika containing about 400 pages, with plates and tables, is issued annually. 

Papers for publication and books and offprints for notice should be sent to Professor Kart Prarson, 
University College, London. It is a condition of publication in Biometrika that the paper shall not 
already have been issued elsewhere, and will not be reprinted without leave of the Editors. It is 
very desirable that a copy of all measurements made, not necessarily for publication, should accom- 
pany each manuscript. In all cases the papers themselves should contain not only the calculated 
constants, but the distributions from which they have been deduced. Diagrams and drawings should be 
sent in a state suitable for direct photographic reproduction, and if on decimal paper it should be blue 
ruled, and the lettering only pencilled. 

Papers will be accepted in French, Italian or German. — In the last case the manuscript should be 
in Roman not German characters. Russian contributors may use Russian bat their papers will be 
translated into English before publication. 

Contributors receive 25 copies of their papers free. Fifty additional copies may be had on 
payment of 17/- per sheet of eight pages, or part of a sheet of eight pages, with an extra charge for 
Plates; these should be ordered when the final proof is returned. 

The subscription price, payable in advance, is 44s, net per volume (including packing and postage) : 
single numbers 16s. net (including postage). Owing to the scarcity of early volumes, the following rates 
must now be charged for complete sets. Vols.[to XVIII: Bound in buckram £63. 10s., in wrappers £56 net. 
Recent volumes may still be obtained at wrapper prices. Standard buckram cases with Darwin block, 
price 3s. 6d. +6d. postage per volume. Index to Vols. I to V, 2s. net. Index to Vols. I to XV, 7s. 6d. net. 
Cheques must be made payable to Professor Karl Pearson and sent to the Secretary, Biometric ‘Laboratory, 
University College, London, W.C.1, to whom all orders for series and single copies should be addressed. 
All cheques must be properly stamped and should be crossed ‘‘ Biometrika Account.” 
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Contents of Part I. October 1926. Price 10s net 
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(WITH 31 TEXT-FIGURES). 
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SYDNEY SMITH, M.D., D.P.H., anpD BORIS BOULGAKOW, M.D. A Case of Dicephalis, Dibrachius, Dipus 
showing certain features of Embryological interest (WITH 7 TEXT-FIGURES). 


SYDNEY SMITH, M.D., D.P.H., anp BORIS BOULGAKOW, M.D. A Case of Cyclopia (w1TH 5 TEXT-FIGURES), 


H. LEIGHTON KESTEVEN, D.Sc., M.D., Ca.M. The Homology of the Ala Temporalis and of the Alisphenoid Bone 
(WITH 20 TE:T-FIGURES). 


J. ERNEST FRAZER, F.R.C.S. The Disappearance of the Precervical Sinus (WITH PLATE I AND § TEXT-FIGURES), 


CHARLES C. MACKLIN anp MADGE THURLOW MACKLIN. Is the Mingazzini Phenomenon, in the Villus of 
the Small Intestine, an Evidence of Absorption ? (WITH PLATE 1). 
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PROFESSOR DR EMIL ABERHALDEN. Handbuch der biologischen Arbeitsmethoden. 
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